#openocd on 2023-09-04 — irc logs at libera.irclog.whitequark.org

2021-05-27 14:41 NishanthMenon changed the topic of #openocd to: this is the place to discuss all things OpenOCD | Logs: https://libera.irclog.whitequark.org/openocd/

00:15 Hawk777 has quit [Quit: Leaving.]

02:04 jn has quit [Ping timeout: 260 seconds]

02:05 jn has joined #openocd

02:05 jn has quit [Changing host]

02:33 crabbedhaloablut has joined #openocd

04:07 olerem has quit [Quit: WeeChat 2.3]

04:10 olerem has joined #openocd

04:12 Hawk777 has joined #openocd

05:06 nerozero has joined #openocd

06:30 antto has quit [Quit: ZNC 1.8.2+deb3.1 - https://znc.in]

06:32 antto has joined #openocd

07:14 bvernoux has joined #openocd

07:31 <Xogium> borneoa___: yeah this one works. Ugh. Not really surprised there. I somehow keep hitting weird issues that noone's ever hit before

07:31 rkta has left #openocd [#openocd]

07:33 <olerem> Xogium: any software is usually implemented and tested for some specific use case. If it is works for any one else is an accident :D

07:33 <Xogium> olerem: yeah :) but I meant, I keep running into actual bugs sometimes, that noone's ever heard of before

07:34 <Xogium> like there was this bug where sometimes, when the conditions were right, that linux kernel could get out of its own infinite loop after a panic and continue running whatever remained of userspace on cpu0. And it is only by luck you would run into it, because preempt and smp

07:35 <Xogium> took us like 3 weeks to find it and squish it to death. This bug was positively ancient. We never found out what introduced it and exactly when, as it was present by the time of the dawn of git history

07:37 <olerem> Xogium: we had a bug where putting bootloader to other, actually proper, location would trigger a cache corruption

07:37 <Xogium> ow

07:37 <Xogium> that's mean

07:37 <Xogium> what happened ?

07:38 <olerem> the reason was - ARM speculative executation was speculation it self in to IO mem. L2 cached got read error and responded with trash to L1 cache so CPU was executing trash instructions

07:38 <Xogium> fwiw I increased the log level of u-boot all the way to 7 but all this does is making my board reset in a loop. My guess is u-boot is now crashing rather than hanging. Wonderful ! :D

07:39 <Xogium> ouch. That's very bad

07:40 <olerem> it was presend in all currently available boot loaders, becouse if it is executed somewehere in the middle of ram, then it would speculated most probably in to zeros, which is ok, except there RAM was not clean

07:40 <Xogium> dang

07:40 <Xogium> must have been difficult to debug

07:41 <olerem> very. i wrote some openocd scripts to dump l1 instruction and data caches

07:41 <Xogium> and I guess my bug whatever it is can qualify as heisenbug. Great. Boosting log level definitely altered the state by making it crash instead of hanging there. I have absolutely no idea what to do by now, besides downgrading the boot chain

07:43 gzlb has joined #openocd

07:45 <Xogium> I know a bunch of ways to debug something but super low level gdb and openocd is definitely not one of my strengths

08:10 slobodan has joined #openocd

08:10 Hawk777 has quit [Quit: Leaving.]

08:22 <Xogium> wouldn't it be dumb if all my problems come from dying micro sd storage haha

08:24 <olerem> Xogium: there can be different source of problems, including bad RAM tuning, wrong PMIC configuration, not fitting PLL configs. Cross tolking between internal componnentes, etc

08:30 <Xogium> olerem: yeah :/ I mean honestly all I did was to bump all versions of ATF, optee and u-boot to the new release by st

08:31 <Xogium> I didn't tweak configs or anything like that except for a different bootcmd to load my bootscr script, and telling it the env isn't to be found via the dt

08:31 <Xogium> but even with those fix not applied it fails the same way

08:32 <olerem> Xogium: i feel your pain

08:33 <Xogium> so I'm now very much confused

08:33 <Xogium> I know that upgrading each component individually isn't to be done but I did it anyway, just to learn what can happen

08:33 <Xogium> and also because I was curious

08:35 <Xogium> upgrading ATF alone makes it panic because of the optee header, upgrading optee alone makes the same thing happen but the other way around, upgrading u-boot alone makes the board bootloop

08:36 <Xogium> and of course upgrading all of them at once just makes it hang. And boosting the log level of u-boot in the hope to see something makes nothing but optee printing "I/TC: forced system reset"

08:37 <Xogium> I reckon I've got my back against the wall so to speak

08:37 Haohmaru has joined #openocd

08:37 <olerem> Xogium: are you sure, you load kernel to a working location?

08:37 <Xogium> I'm not even getting to kernel

08:37 <Xogium> I go from ATF to optee, then optee to u-boot... Which just basically hangs forever

08:38 <Xogium> with nothing printed on uart

08:40 <olerem> i work with some stm32mp1 based systems, which work fine so far, except i use mainline TF-A, no opetee for now and barebox instead of u-boot. but this information is probably no help for you

08:41 <Xogium> I would like to, but I rely on power management unfortunately

08:41 <Xogium> suspend to ram and all of those fancy things

08:42 <Xogium> but if I could have done without, I'd have sure gone full mainline

08:42 <Xogium> no offense to st of course they're brilliant. It's just, lets say that u-boot and I, we don't exactly enjoy each other's company ;)

08:43 <olerem> Xogium: my work is bringing things mainline. so - i need to use alwys mainline

08:43 <Xogium> of course yes, this makes sense :)

08:44 <Xogium> frnakly by now I'm tempted to just screw it and go back to older versions of softwares bu for kernel, hoping 6.1 kernel can be booted fine using older ATF and such. All of this started because of a bug I'm trying to get rid of, where the otg port on the dk2 sometimes chokes to death until you reboot the board

09:33 rkta has joined #openocd

10:13 a3f has joined #openocd

10:42 dormito has quit [Ping timeout: 246 seconds]

10:42 <a3f> I've an i.MX6Q (Cortex-A9) and I want to use OpenOCD v0.12.0 to set a memory watchpoint on an address. CPU is running bare-metal firmware in thumb mode. This works and on first access, the watchpoint is triggered. Now I remove the watchpoint, step a few instructions after the instruction which triggered the watchpoint, readd the watchpoint and resume, only to find the watchpoint trigger immediately again at the current instruction. This

10:42 <a3f> happens whether I am using the shell over telnet or gdb. Any ideas or experience with watchpoints on Cortex-A9?

10:44 dormito has joined #openocd

10:56 <olerem> borneoa___: may be you have more expiriance with this? ^

10:58 <olerem> zapb_: ^ any may be you too :)

11:03 <Haohmaru> baremetal on cortex-A hm..

11:04 <a3f> Haohmaru, debugging a suspected memory corruption in the bootloader. unfortunately kasan didn't spot it.

11:05 <Haohmaru> nah i was just thinking for one of my failed projects where i need muchos MHz

11:15 <Xogium> borneoa___, PaulFertser, bvernoux: hot damn. You won't believe it. My boot issues were partly due to me messing something up but not so much. I was half joking this morning when I said, wouldn't it be dumb if it was my storage failing ? It was. I reflashed the previously perfectly working boot chain and it was failing

11:16 <Xogium> I *HATE* micro sd

11:16 <olerem> a3f: there are some differences in the watchpoint control register of A7 and A9. See:

11:16 <olerem> https://developer.arm.com/documentation/ddi0464/e/Debug/Debug-register-descriptions/Watchpoint-Control-Registers

11:16 <olerem> https://developer.arm.com/documentation/ddi0388/i/debug/debug-register-descriptions/watchpoint-control-registers

11:16 <Xogium> as in, I really, seriously hate those things

11:17 <olerem> a3f: bits 13 - 5

11:17 <Xogium> I did what I could to prevent degradation, not that there's a lot to be made given the way micro sd work, but it failed within 2 years. Pathetic

11:18 <Xogium> the best I can do I guess is aim for slc micro sd... Ugh

11:18 <Haohmaru> put an HDD on it

11:18 <olerem> lol

11:18 <Xogium> hdd over usb 2 ? Huh huh

11:18 <Xogium> not sure ;)

11:19 <Haohmaru> CDROM

11:19 <olerem> will make you love micro SD :)

11:19 <Xogium> ^^

11:19 <Haohmaru> CompactDisc(tm)

11:19 <Xogium> I hate how micro sd failure is SO subtle you can't spot it easily 99% of the time

11:20 <olerem> Xogium: i would bet, none of lowlevel init things support any kind of error handling or reporting for sd/mmc controller

11:20 <Xogium> there was nothing that seemed off, really, but for the inconsistent behavior shown when you really dig into it. It used to work, it doesn't work anymore and you have no idea what happened and when it happened to be just over the edge

11:21 <Xogium> olerem: no, generally you have no way of knowing. You can do reverse engineering like arnd likes to do with *cough* kingston micro sd

11:22 merethan has joined #openocd

11:22 <Xogium> but aside from this, there's not many ways to see it coming. Even on so-called induistrial micro sd that have slc flash and claim to support smart, the info are minimal and often you need a specific program or at a minimum to sign an NDA to read how they encode the smart data, to be able to make your program to read them

11:23 <Haohmaru> HDD with S.M.A.R.T. enabled ;P~

11:23 <Haohmaru> wtf

11:24 <Xogium> which... Honestly, I have a stupid idea that might work

11:24 <olerem> Xogium: something what no one would implement in a boot loader anyway? :)

11:24 <Xogium> using the micro sd to eMMC adapter that came with my odroid c2 years ago to stuff this bad boy into the dk2

11:25 <Xogium> I'll lose like 90% of its hs400 speed but at least I'll have actual eMMC storage

11:27 <Xogium> olerem: yes there's also that. Noone will have health checking in a bootloader. But thing is, who knows when the health starts to fail, really ? When does it become too much and alarm bells should start going off ?

11:27 <Xogium> I've been dealing with boards that use micro sd for years and I still haven't found the answer to that

11:28 <olerem> Xogium: i love to implement diagnostics, but it is something hard to sell. Usuall answer is: We need more bells, not diags!!!

11:28 <Xogium> micro sd can fail in so various ways... Some very discretely like this one, where the failures accumulate day by day and sneak up on you and even then you struggle noticing it, some will on the other hand literally bit rot in your face. Yes I've seen that happening. Out of nowhere, on a perfectly fine running system, files were renamed and bit flipped all over

11:29 <olerem> Xogium: you know, some file systems have checksumming?

11:29 <Xogium> yep

11:30 <Xogium> but what do you do when the failure reaches the bootloader ? :/

11:30 <olerem> Xogium: and you are suing it?

11:30 <PaulFertser> Xogium: SD cards are awful. Probably unless you buy them from mouser/digikey.

11:30 <olerem> Xogium: checksumming bootloader and boot env still may tell you if something went wrong

11:31 <Xogium> PaulFertser: honest that's what I plan on doing. Even then they are notoriously bad storage medium, and I'd use eMMC over them any day. But mouser doesn't sell low end crap

11:31 <olerem> Xogium: can be bricked too :)

11:31 <olerem> s//eMMC

11:32 <Xogium> of course it can be bricked

11:32 <Xogium> but at least the eMMC standard has specification of health status :)

11:32 <olerem> ok. good point

11:33 <Xogium> and since they work differently you can also literally tell the chip inside to behave differently with one partiton and not another (i.e: enhanced user area)

11:34 <olerem> a3f: do you have your opinnion on eMMCs? :)

11:35 <Xogium> as for checksum of the bootloader, I'm actually unsure how to do it. I mean, I can check the sha256 of the fip.bin and such, but how do I look at the one burned on the device to compare ? Surely it isn't as easy as passing the partition where the fip has been written, because there might be some empty space left, so that will of course give a different sha256

11:36 <PaulFertser> Xogium: you can connect hardkernel eMMC boards with https://www.hardkernel.com/shop/emmc-module-reader-board-for-os-upgrade/ to a regular uSD slot.

11:36 * Haohmaru recently made a cortex-m4 bootloader with sha256 stuff

11:36 <Xogium> PaulFertser: yeah that's what I plan on doing for this one here. I only have the one eMMC and the one adapter, but I might grab a few more of them

11:37 rkta has left #openocd [#openocd]

11:39 <Xogium> this isn't perfect far from it... But... Ugh

11:40 <Xogium> at least with eMMC when the health status changes you can check on it

11:40 <Xogium> you can even make it so you get notified of that

11:40 <olerem> Xogium:how?

11:40 <Xogium> olerem: mmc-utils :)

11:41 <Xogium> it has a command to read the registers which can expose the health status of the eMMC

11:41 <Xogium> among other things

11:41 <Xogium> so wrapping some script around this can definitely be useful

11:42 <olerem> Xogium: ack, so there is still not central diagnostic infrastructure to nitify user or some other component about prefail conditions of some part

11:42 <Xogium> unfortunately not. Technically it shouldn't be difficult to make since we've already got the health status info, just that noone'd done it

11:43 <olerem> this kind of project is complicated enought to not do it a hobby time...

11:44 <Xogium> something that runs in the background and probes at the status info, warning you if it changes in a pretty form... I don't think it'd be too difficult to make

11:44 <Xogium> but yeah

11:44 <Haohmaru> when corruption is detected - launch the matrix screensaver

11:44 <Xogium> :D

11:45 <Xogium> if you still can !

11:45 <olerem> init a self destruction of the nuce power plant

11:45 <Haohmaru> you have to have faith

11:45 <Xogium> now, where the hell did I put that 64 gb eMMC

11:45 <Xogium> ;)

11:46 <Xogium> I might even be nice and enable the enhanced user area so it gets a nice 32 gb of pseudo slc flash

11:46 <Xogium> approximately 32 gb mind you

11:47 <Xogium> cause honestly, slc micro sd is pricy... I've got a 512 mb one in another board for logging purpose and that was like 30 euros or something

11:48 <Haohmaru> what's slc even

11:48 <Xogium> single level cell

11:49 <Xogium> i.e: instead of storing 2 bits per cell like mlc, or 3 like tlc... It stores one

11:49 <a3f> olerem, my opinion is don't run your system off a SD-Card

11:49 <Xogium> a3f: I agree ^^

11:50 <Xogium> slc makes the cells last longer because they don't wear out as quickly since they store just one bit per cell

11:50 <a3f> olerem, oh, thanks for the pointer! I will try to see if I can find their use sites in Linux and compare what openocd might be missing

11:51 <Xogium> with the disadvantage that it is often slower and of course reduced capacity

11:52 <Xogium> real slc I've seen can go up to max 2 gb afair. After that you jump straight to pslc which is basically software emulating how slc would behave and intentionally making it so data is stored one bit per cell

11:52 <Haohmaru> huh

11:54 <Xogium> some areas of a ssd are slc, the cache is one iirc. But the rest is basically all mlc at a minimum. The only ones that can have slc all the way are micro sd and eMMC afaik

11:54 <Xogium> bootareas of eMMC that are mlc are also often emulated slc

11:55 <Xogium> oh yes and of course the place where your ssd stores its firmware

11:55 <Haohmaru> FRAM ftw

11:56 <Xogium> I learned most of that chatting with an awesome person who worked on systems that were to go all the way up to the ISS

11:56 <Xogium> :D

11:57 <Haohmaru> i could've told you that flash memory is "meh" years ago if you asked me ;P~

11:57 <Xogium> Haohmaru: of course it is

11:58 <Xogium> but all those types, slc, mlc, etc. It's still nice to know how they work, I find

11:58 <Haohmaru> and i don't understand the madness with these SSDs, they are flash too, right?

11:58 <Xogium> worst of them all being qlc... God I hope they don't make one where they can store 5 bits per cell, 4 is already horrible

11:59 <a3f> olerem, they seem to just have repurposed unused bits for hypervisor support. Linux doesn't treat the register differently between A7 and A9 (cf. encode_ctrl_reg())

11:59 <Xogium> Haohmaru: yep they are

11:59 <Haohmaru> i'm staying on HDDs then ;P~

12:00 <Xogium> though, in all fairness to ssd, they seem to have way better firmware and flash quality than micro sd ;) not to mention they have smart data

12:00 <Xogium> mine here is showing 10% worn out after about 3 years

12:01 <Xogium> and that's with heavy compilation and stuff going on almost daily

12:08 <olerem> a3f: cortex a9 is ARM_DEBUG_ARCH_V7 (or ARM_DEBUG_ARCH_V7_ECP14?) and cortex a7 is ARM_DEBUG_ARCH_V7_1, if i see it correctly

12:17 <a3f> olerem, You're right. Cortex-A9 is indeed ARM_DEBUG_ARCH_V7_ECP14 and Cortex-A7 is ARM_DEBUG_ARCH_V7_1...

12:17 <a3f> I thought Cortex-A7 being ARMv7.0 means debug arch is v7.0 too...

12:45 slobodan has quit [Read error: Connection reset by peer]

12:46 slobodan has joined #openocd

13:03 slobodan has quit [Remote host closed the connection]

13:04 slobodan has joined #openocd

13:51 sam has joined #openocd

13:51 confused123 has joined #openocd

14:03 <sam> Hi everyone, i am looking for some help with an issue i have encountered. i have 2 TAPs declared on a jtag chain. one of which is an ARM DAP (cortex-m) and the other a simple TAP. when we declare the ARM DAP as a cortex-m target we can communicate with the ARM just fine but when i try to access the other TAP with a drscan command i get the

14:03 <sam> following error and openocd exits:

14:03 <sam> Debug: 38 28720 command.c:155 script_debug(): command - irscan mo.tap_1 0x1

14:03 <sam> Debug: 39 28750 command.c:155 script_debug(): command - drscan mo.tap_1 32 0x0

14:03 <sam> Assertion failed: active == tap, file /__w/openocd-xpack/openocd-xpack/build/win32-x64/sources/openocd.git/src/jtag/drivers/driver.c, line 141

14:03 <sam> Debug: 40 28751 server.c:607 sig_handler(): Terminating on Signal 22

14:03 <sam> but when i declare the DAP as a mem_ap target, i can access the other TAP just fine. Does anyone have an idea why this error happens in the first case?

14:16 confused123 has quit [Quit: Client closed]

14:26 <PaulFertser> sam: probably you didn't disable polling?

14:27 <PaulFertser> sam: with a target tap defined it's probably polled all the time. And for drscan to work you need IR regiser of the tap to have some specific value. But if the target is polled the other taps will be switched to BYPASS.

14:29 <sam> Hi Paul, i indeed have not.. i was wondering if its something like that, but i was not sure how to do that..

14:31 <sam> so i just have to use "poll off" when i want to communicate with the TAP and "poll on" when switching back to the DAP?

14:32 <sam> yes, that indeed works! many thanks PaulFertser!

14:32 <PaulFertser> sam: that's my guess, es.

14:32 <PaulFertser> sam: happy to hear! :)

14:48 sam has quit [Quit: Client closed]

15:44 Chris__ has quit [Ping timeout: 245 seconds]

15:45 wingsorc has quit [Ping timeout: 246 seconds]

16:27 zkrx has quit [Ping timeout: 260 seconds]

17:11 Haohmaru has quit [Ping timeout: 240 seconds]

17:25 merethan has quit [Ping timeout: 244 seconds]

18:37 crabbedhaloablut has quit []

19:04 nerozero has quit [Ping timeout: 240 seconds]

19:47 crabbedhaloablut has joined #openocd

19:57 crabbedhaloablut has quit []

20:07 josuah has quit [Quit: josuah]

20:07 josuah has joined #openocd

20:11 uartist7 has joined #openocd

20:11 rkta_ has joined #openocd

20:11 cozycactus_ has joined #openocd

20:12 cyrozap_ has joined #openocd

20:12 JakeSays_ has joined #openocd

20:12 JakeSays has quit [Read error: Connection reset by peer]

20:12 silurian_invader has quit [Ping timeout: 246 seconds]

20:12 xantoz has quit [Ping timeout: 246 seconds]

20:12 cozycactus has quit [Ping timeout: 246 seconds]

20:12 zmatt has quit [Ping timeout: 246 seconds]

20:12 cyrozap has quit [Ping timeout: 246 seconds]

20:12 uartist has quit [Ping timeout: 246 seconds]

20:12 cozycactus_ is now known as cozycactus

20:12 uartist7 is now known as uartist

20:12 xantoz has joined #openocd

20:13 bryanb has quit [Read error: Connection reset by peer]

20:13 lh has quit [Write error: Connection reset by peer]

20:14 lh has joined #openocd

20:14 bryanb has joined #openocd

20:19 zmatt has joined #openocd

20:26 silurian_invader has joined #openocd

20:54 bvernoux has quit [Quit: Leaving]

21:25 slobodan has quit [Ping timeout: 255 seconds]

21:46 tsal has quit [Ping timeout: 246 seconds]

21:47 marex has quit [Ping timeout: 246 seconds]

21:49 marex has joined #openocd

21:53 tsal has joined #openocd

23:36 Hawk777 has joined #openocd