#rust-embedded on 2023-12-10 — irc logs at libera.irclog.whitequark.org

2022-02-07 19:20 ChanServ changed the topic of #rust-embedded to: Welcome to the Rust Embedded IRC channel! Bridged to #rust-embedded:matrix.org and logged at https://libera.irclog.whitequark.org/rust-embedded, code of conduct at https://www.rust-lang.org/conduct.html

00:20 <thejpster[m]> That book is probably due an update, if anyone feels like they have the time.

00:26 <vollbrecht[m]> keeping the books up-to-date is hard work. rust on embedded is just moving to fast xD

01:00 Guest7282 has joined #rust-embedded

02:41 starblue has quit [Ping timeout: 256 seconds]

02:43 starblue has joined #rust-embedded

03:40 Guest38 has joined #rust-embedded

03:40 <Guest38> nom is such a cool library

03:42 <Guest38> I just discovered it. I am using it for advent of code rn.

03:50 Guest38 has quit [Quit: Client closed]

04:02 pbsds has quit [Quit: The Lounge - https://thelounge.chat]

04:03 pbsds has joined #rust-embedded

04:09 pbsds has quit [Quit: The Lounge - https://thelounge.chat]

04:11 pbsds has joined #rust-embedded

04:27 Guest7282 has left #rust-embedded [Error from remote client]

05:16 pbsds has quit [Quit: The Lounge - https://thelounge.chat]

05:19 pbsds has joined #rust-embedded

05:21 emerent_ has joined #rust-embedded

05:21 emerent has quit [Killed (platinum.libera.chat (Nickname regained by services))]

05:21 emerent_ is now known as emerent

06:25 <bartmassey[m]> Henk and I are thinking about an update to Rust Disco Book for MB2. Expect a bunch of stuff from us once Winter Break has passed.

06:26 <bartmassey[m]> Belated thanks to everyone for the DTS/SVD info. I think I can see my way to maybe doing something with this fancy RISCV-64 chip on the Milk-V someday when I have time.

06:27 <M9names[m]> <thejpster[m]> "That book is probably due an..." <- it's likely also due for a review and discussion on what the content of the book should look like 2024.

06:27 <M9names[m]> > This project is developed and maintained by the Resources team.

06:27 <M9names[m]> does this mean that someone would have to be on the Resources team to do this update?

06:27 <bartmassey[m]> https://github.com/BartMassey/libaoc.git if you like. I used to use it back when I was doing AoC in Rust. It's… some code I used.

06:28 <bartmassey[m]> (and wrote, to be clear)

06:29 <bartmassey[m]> It has a fancy parser derive macro thing for input. But I'm off-topic, apologies

06:33 <bartmassey[m]> On-topic, I've started working with Henk on Embedded Rust Course Notes, and it has turned into an mdbook. Will make public shortly after break when it's not a barely-started eldritch abomination anymore.

06:33 <bartmassey[m]> I think my strat is to get that in shape, then see about the MB2 Disco Book and see what can be moved in there.

07:11 Guest7282 has joined #rust-embedded

07:13 Guest7282 has left #rust-embedded [Error from remote client]

07:19 crabbedhaloablut has joined #rust-embedded

08:23 <thejpster[m]> <M9names[m]> "it's likely also due for a..." <- > <@9names:matrix.org> it's likely also due for a review and discussion on what the content of the book should look like 2024.... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/KkZZrssGqXbjEGUmCayfTUuW>)

08:57 marmrt[m] has quit [Quit: Idle timeout reached: 172800s]

09:21 djdisodo[m] has joined #rust-embedded

09:21 <djdisodo[m]> i'm dealing with tiny ram(2048 bytes) on arduino uno, also as far as i know .data section loads up to SRAM before main starts

09:21 <djdisodo[m]> and i found that .data section is full of strings that are probably for error messages, but i think these values should be constant, why are these loaded onto sram?

09:23 <djdisodo[m]> that is quite painful since this kind of strings are using 1037 bytes worth of space now

09:25 <djdisodo[m]> not all error message live in .data and some live in .LOAD tho

09:25 <djdisodo[m]> * not all error message live in .data and some live in LOAD(code space) tho

09:30 * djdisodo[m] uploaded an image: (90KiB) < https://catircservices.org/_matrix/media/v3/download/matrix.org/wtHTqhzDOpfVfdMxhyufoVmh/image.png >

09:36 <M9names[m]> AVR is a harvard architecture. code and data address spaces are separate.

09:36 <M9names[m]> for data (like strings) to be usable they have to be in RAM. that's just how it goes.

09:37 <djdisodo[m]> M9names[m]: > <@9names:matrix.org> AVR is a harvard architecture. code and data address spaces are separate.

09:37 <djdisodo[m]> > for data (like strings) to be usable they have to be in RAM. that's just how it goes.

09:37 <djdisodo[m]> i think they can load right before use rather than loading them before main entry

09:38 <M9names[m]> in theory, yes, that is possible, and the C SDK has a bunch of machinery to make that easier but it's still a manual process to store a string in PROGMEM and a special version of printf that will do the copy first

09:40 <djdisodo[m]> M9names[m]: some of strings are in LOAD space so i think it's just that compiler decides

09:40 <djdisodo[m]> is there any way to make compiler prefer PROGMEM

09:45 <M9names[m]> i don't think so. how would it know what should live there and what should not?

09:46 <djdisodo[m]> M9names[m]: if it's const or static, it's read only so put in progmem

09:46 <djdisodo[m]> if it's static mut put in ram

09:47 <M9names[m]> if it's const it's put directly where it's used.

09:47 <djdisodo[m]> yes

09:48 <djdisodo[m]> but i really think that strings are const, why would they be anything else?

10:01 <M9names[m]> <djdisodo[m]> "if it's const or static, it's..." <- the problem is that both the C and Rust memory model assume that all memory is addressable. that isn't true for you, PROGMEM is on a completely separate bus from your RAM.

10:01 <M9names[m]> so if the compiler automatically put things in PROGMEM you would immediately run into hard to debug errors, because your code would have pointers that don't actually point to your data.

10:01 <M9names[m]> also those pointers to strings also have to live in RAM to be used even though they are const - same reason as above.

10:05 <djdisodo[m]> ok

10:05 <M9names[m]> those "error messages" in your screenshots are panic messages. i don't think there's any easy way to tell them where to live.

10:05 <M9names[m]> you can get LTO to remove them if you use panic-immediate-abort

10:06 <djdisodo[m]> oh

10:07 <M9names[m]> you can find more avr-specific help in #avr-rust_Lobby:gitter.im. it's a pretty small community though, so answers might take a while

10:08 <djdisodo[m]> tha sounds great tho maybe i should use simavr with mcu that has bigger ram when i need to enable panic message when i need to debug

10:09 Guest7282 has joined #rust-embedded

10:23 <djdisodo[m]> rando thought but if i put #[inline(always)] on panic handler won't string be inlined as well?

10:33 IlPalazzo-ojiisa has joined #rust-embedded

11:51 <djdisodo[m]> <M9names[m]> "those "error messages" in your..." <- > <@9names:matrix.org> those "error messages" in your screenshots are panic messages. i don't think there's any easy way to tell them where to live.

11:51 <djdisodo[m]> > you can get LTO to remove them if you use panic-immediate-abort

11:51 <djdisodo[m]> i used panic_halt and it eliminated most of them but there's still some which is from .expect("");

12:20 <JamesMunns[m]> <M9names[m]> "it's likely also due for a..." <- > <@9names:matrix.org> it's likely also due for a review and discussion on what the content of the book should look like 2024.... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/WqjUXsYgmsGuKbmvYaqfnlKq>)

12:53 Guest7282 has left #rust-embedded [Error from remote client]

13:05 <JamesMunns[m]> In general, if someone:... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/mEjqRlVbqkFwOOrcXBzlJIhV>)

13:05 <thejpster[m]> <M9names[m]> "the problem is that both the C..." <- > <@9names:matrix.org> the problem is that both the C and Rust memory model assume that all memory is addressable. that isn't true for you, PROGMEM is on a completely separate bus from your RAM.... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/LbMgtiqMUvRGakmIfytDNosn>)

13:14 Guest7282 has joined #rust-embedded

13:35 Noah[m]1 has joined #rust-embedded

13:35 <Noah[m]1> does anyone know if I can const transform (no runtime code) a `&'static str` into a `[u8; 128]`? I realize that the sizes might not match. in that case I would like to add this just the length the str has, but then still make the array in the struct that size.

13:35 <JamesMunns[m]> `[u8; 128]` or `&[u8; 128]`?

13:36 <JamesMunns[m]> and where are you getting the str from? `include_str!()`? Noah

13:38 <Noah[m]1> JamesMunns[m]: nope, array, not slice, so `[u8; 128]`

13:38 <Noah[m]1> JamesMunns[m]: from a string literal :)

13:38 <Noah[m]1> Specifically here: https://github.com/probe-rs/flash-algorithm/blob/master/src/lib.rs#L202C21-L202C21

13:39 <Noah[m]1> I mean we can just put all zeroes which is fine, but a string would be nicer

13:39 <Noah[m]1> and ofc you can provide an array but that kinda also sucks

13:39 <JamesMunns[m]> so you want:... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/ScfkeYYmWaWzlezCeBzgysgQ>)

13:41 <Noah[m]1> JamesMunns[m]: > <@jamesmunns:beeper.com> so you want:... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/xjORoeDQSEbALTwIxmJWzzig>)

13:41 <JamesMunns[m]> it is, typing now

13:41 <Noah[m]1> uii neato! :)

13:41 Lumpio[m] has joined #rust-embedded

13:41 <Lumpio[m]> Noah[m]1: fwiw the later is a reference to an array, not a slice

13:42 <Noah[m]1> Lumpio[m]: ah yep, I am stupid, absolutely! :)

13:45 <JamesMunns[m]> playground is on the struggle bus, but:... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/vprhVqtvyrmRAFvlxVqIiXGr>)

13:46 <JamesMunns[m]> be careful tho, this doesn't handle utf-8 really, in that it'll truncate AT 128 bytes, which might not be a valid utf-8 boundary

13:46 <Noah[m]1> JamesMunns[m]: > <@jamesmunns:beeper.com> playground is on the struggle bus, but:... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/iEQfkqPRYBDYuMUDFWYfaapS>)

13:46 <Noah[m]1> thanks so much man! I will give it a go :)

13:47 <Lumpio[m]> const fns can do surprising things already but it feels pretty... manual

13:47 <Noah[m]1> yeah, I think it's fair to let the user know in the docs. worst case the string that is displayed is crooked.

13:47 <JamesMunns[m]> so, probably you need to decide:

13:47 <JamesMunns[m]> * do you stick with utf-8, and truncate early correctly

13:47 <JamesMunns[m]> * do you stick with ASCII/c-strings, ensure everything is an ascii character (and replace non ascii chars), and ensure there's a zero terminator or something

13:47 <Lumpio[m]> We have copy_from_slice but nooo it's not const implement memcpy by hand instead

13:47 <Noah[m]1> I think the spec might suggest only ASCII is allowed since it tailors to C

13:48 <JamesMunns[m]> There's a fun double trick you can do for turning include_bytes or include_str into arrays with compile time decided len, too

13:49 <Noah[m]1> uii

13:49 <Noah[m]1> yeah, I mean the flash algo api is up for debate :)

13:49 <Noah[m]1> but so far it made a nice improession :)

13:49 <Lumpio[m]> Noah @yatekii:matrix.org: Why not do a year of ruet adventures as a job

13:49 <Lumpio[m]> s/ruet/rust/

13:51 * JamesMunns[m] sent a rust code block: https://catircservices.org/_matrix/media/v3/download/catircservices.org/BNGhDQRHoHnYviFIahnEJood

13:52 <Lumpio[m]> galaxy brain

13:52 <JamesMunns[m]> This is useful because if you want to place the array in a specific linker section, you can

13:53 <JamesMunns[m]> (whereas `&[u8]`/`&str` cannot be)

13:53 <JamesMunns[m]> oh, for that it probably needs to be static TOML_ARR not const TOML_ARR, but works the same

13:54 <JamesMunns[m]> This is pretty useful if you have some sort of header or metadata you can calculate in a build-rs, drop it to the filesystem, then include_bytes it into a specific flash location

13:55 <JamesMunns[m]> useful for things like git hashes, tho not for CRC32 and such, as build-rs runs BEFORE compiling the binary.

13:55 <Lumpio[m]> post_build.rs when

13:56 <Lumpio[m]> ...why does Element mobile think that is a domain name, it's not even valid

13:56 <JamesMunns[m]> Lumpio[m]: doesn't help if you need the data AT build time :D

13:56 <JamesMunns[m]> yeah, that's why I type build-rs lol

13:58 <Lumpio[m]> A post_build.rs could maybe edit your binary afterwards to add a crc or whatnot

13:58 <vollbrecht[m]> <JamesMunns[m]> "const TOML_BYTES: &[u8] = includ..." <- > <@jamesmunns:beeper.com> ```rust... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/kZzKjnGXCuaTFPWTbDebOvFe>)

14:11 <adamgreig[m]> For the short string can you just have something like `static X: [u8; 12] = *b"hello world";`?

14:12 <adamgreig[m]> Not sure if there's an easy way to pad it out without writing the 0s though, I usually handled that separately when needed

14:16 <JamesMunns[m]> Yeah, my tricks are specifically for when you are using a variably sized payload. Agreed you can do that if you know the existing size

14:16 <JamesMunns[m]> no way to zero-fill tho, afaik

14:17 <JamesMunns[m]> (other than using a const-fn, like I did)

15:01 <Noah[m]1> <Lumpio[m]> "Noah @yatekii:matrix.org: Why..." <- if you wanna pay for it :)

15:01 <Noah[m]1> <JamesMunns[m]> "This is useful because if you..." <- hmm I might need that, I am not sure

15:05 <Noah[m]1> <adamgreig[m]> "For the short string can you..." <- ah lol true! but how would you zero pad that? :)

15:05 <Noah[m]1> ah james already mentioned it :)

15:05 <Noah[m]1> ofc :D

15:05 <Noah[m]1> everybody giving me hard imposter syndrome

15:08 <adamgreig[m]> Depends why it needs padding, could just manually write some extra \x00 or otherwise I'll often do the padding in firmware if it's just like "need to transmit 128 bytes starting with this string from flash", I'll transmit string then 128-len 0s or whatever. If you need a mutable buffer pre-init with some string at the start and the rest 0s it doesn't really work, james' const fn is best then I think

15:13 IlPalazzo-ojiisa has quit [Read error: Connection reset by peer]

15:14 IlPalazzo-ojiisa has joined #rust-embedded

15:16 hifi has left #rust-embedded [#rust-embedded]

15:24 AdrianGeipert[m] has quit [Quit: Idle timeout reached: 172800s]

15:30 <Noah[m]1> <adamgreig[m]> "Depends why it needs padding..." <- yeah but here we need to basically fulfil the CMSIS-DAP standard :)

15:30 <Noah[m]1> So I cannot just model it however I like :)

15:55 <Noah[m]1> what the duck, copilot can autocomplete my complete macro syntax that I just wrote ...

15:56 danielb[m] has joined #rust-embedded

15:56 <danielb[m]> sponsored by Big Macro?

15:59 <danielb[m]> copilot is usually good enough with figuring out regularities so I guess this is some nice extra

16:18 Guest7282 has left #rust-embedded [Error from remote client]

16:55 PeterHansen[m] has joined #rust-embedded

16:55 <PeterHansen[m]> Should this output be possible (specifically the PC value being somewhere in RAM)?... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/gJvbVkAaVfLvkLLkZHWySIAE>)

16:55 <PeterHansen[m]> * Should this output be possible (specifically the PC value being somewhere in RAM)?... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/rKpusqCtvIOGbiORKnmvxphv>)

16:55 <PeterHansen[m]> The handler is just... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/uBElRPdhfRgDlYljbdFiWTFj>)

16:58 <PeterHansen[m]> The LR (which I understand is the return address) is in this code immediately following the memcpy call. Is it possible memcpy does crazy magic that runs code from RAM?... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/AWArUnslWcNPeFqAVPDhLuiC>)

16:59 <PeterHansen[m]> * The LR (which I understand is the return address) points, I guess, to the 27c56 line immediately following the memcpy call. Is it possible memcpy does crazy magic that runs code from RAM?... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/zVOVfmsMHiQQBnKvpWzONWWR>)

17:36 <thejpster[m]> Seems unlikely. Does objdump show any functions in RAM? Anything that matches that address? Which target are you using?

18:06 AdamHott[m] has joined #rust-embedded

18:06 <AdamHott[m]> Hello, does anyone have a reliable source that distributes FROM the EU that has any of these KittenBot Robotbit Robotics expansion board for micro:bit? https://www.kittenbot.cc/products/robotbit-robotics-expansion-board-for-micro-bit

18:35 Guest7282 has joined #rust-embedded

19:22 <PeterHansen[m]> <thejpster[m]> "Seems unlikely. Does objdump..." <- There are no functions there. I just checked and it's actually in the area allocated to HEAP. Target is thumbv7em-none-eabihf. Is it safe to say that one would expect that PC could *never* be in RAM in any normal code?

19:22 <diondokter[m]> PeterHansen[m]: It could, but it's not common

19:23 <JamesMunns[m]> it does seem unlikely. AFAIK nothing by default does that.

19:23 <JamesMunns[m]> If you did have some kind of stack or other data corruption, it's possible you corrupted something else tho

19:24 <JamesMunns[m]> like, if you are doing some UB somewhere else, and it corrupted some value, you could end up in the wrong place. Do you have a small example that reproduces this just by printing the None like you mentioned earlier?

19:25 <JamesMunns[m]> and if so: could you post that full setup somewhere? like a GH repo other people could compile and debug? you're certainly seeing something very weird, outside of something we could probably wildly guess at

19:25 <PeterHansen[m]> Note that the hardfault occurs only when I do anything that tries to format! an Option that is a None. Not a Some. And by checking the stack a bit it seems that it is indeed occurring in the midst of it attempting to format an Option, and looks like it's trying to grow a Vec in the middle of that. It's nearly certain this is nothing I'm doing wrong... but I'm still trying to reduce this to code that someone else could run

19:25 <PeterHansen[m]> that reproduces this. It's one of those problems where just changing the layout of the code by removing stuff seems to affect it.

19:26 <JamesMunns[m]> PeterHansen[m]: are you maybe overflowing your stack while growing the vec?

19:26 <PeterHansen[m]> Definitely not.

19:26 <JamesMunns[m]> or, is your heap misconfigured and corrupting your stack?

19:26 <diondokter[m]> It's on the heap, right? Could you increase the heap size?

19:27 <PeterHansen[m]> At this point I'm doing little more than initializing Embassy, spawning two tasks that both then sleep forever, and doing these formats. Unfortunately if I try to strip all the other code out, even though it's not being called, it can make the problem vanish, so I'm still working on that reduction.

19:27 <JamesMunns[m]> i'd be interested in how you setup your heap, and what your linker script looks like

19:27 <PeterHansen[m]> HEAP is 128K, but this is the first and only thing being allocated now.

19:27 <diondokter[m]> PeterHansen[m]: Ah, then the size not an issue

19:27 <PeterHansen[m]> HEAP setup is the generic embedded-alloc setup.

19:28 <PeterHansen[m]> I'll continue working to reduce this (not my first rodeo), but I did just want confirmation that a PC in RAM is simply wrong... i.e. there's no chance that a compiler optimization, or memcpy implementation could be running self-generated code in RAM.

19:28 <JamesMunns[m]> So you're doing the exact thing in this example?

19:28 <JamesMunns[m]> https://docs.rs/embedded-alloc/latest/embedded_alloc/#example

19:28 <PeterHansen[m]> In this case, now that I've noticed the address is in my heap, I think it's safe to say that's simply not possible.

19:29 merFurkanDemirci has quit [Quit: Idle timeout reached: 172800s]

19:29 <PeterHansen[m]> Yes, identical to that, modulo the fact I have the code spread across more modules and lots of irrelevant stuff still being compiled and linked in, even though it's no longer called.

19:30 Guest7282 has left #rust-embedded [Error from remote client]

19:30 <diondokter[m]> Is it always the same PC address?

19:30 <diondokter[m]> If so, maybe you can find the jump by using a conditional watchpoint on the PC in GPB.

19:30 <diondokter[m]> This is super mega awfully slow, but it might help

19:30 <diondokter[m]> s/GPB/GDB/

19:31 <JamesMunns[m]> > there's no chance that a compiler optimization, or memcpy implementation could be running self-generated code in RAM

19:31 <JamesMunns[m]> I have never heard of Rust doing that ever, no. If you are running code in Ram, you'd know (and have to set it up yourself)

19:31 <PeterHansen[m]> I had originally mentioned this in this channel several months ago, before I'd discovered it was a hard fault. At the time I meant to post in the embassy channel because I thought it likely it was related to that somehow. Then yesterday I posted in the embassy channel asking for helping analyzing hard faults, but shortly after I realized this is likely nothing to do with embassy (when I narrowed it down to Option::None debug

19:31 <PeterHansen[m]> formatting). What I originally found when I first posted is the same now though: this fails with 2023-08-09 or later, but works fine with 2023-08-08 or earlier. And always the same address, same stack trace.

19:31 <diondokter[m]> diondokter[m]: > <@diondokter:matrix.org> Is it always the same PC address?... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/tsTMYlhEildlYbyQQVUOzbGs>)

19:32 <PeterHansen[m]> diondokter: I could probably do that, but I have no GDB-fu... never used it.

19:32 <diondokter[m]> PeterHansen[m]: Mine is bad too, but with a bit of google-fu it's doable

19:32 <JamesMunns[m]> Formatting is somewhat interesting in that it uses dynamic dispatch. It could be a miscompilation of that. The code will still be in flash, but the vtable will be in RAM.

19:33 <PeterHansen[m]> I got as far yesterday as running probe-rs gdb, then running gdb and trying some attach statement that appeared to connect to the probe-rs process, but at that point I had no idea what to do next... so in theory if that would be the best way to proceed, with a little guidance I could do it.

19:34 <PeterHansen[m]> That said, when I first brought this up someone suggested a rustc bisect procedure. At the time it was infeasible for me since reproducing this required an awkward procedure, but now that I know it's a hard fault I can get instant positive indication of the failure so bisect is an option. Especially if I don't have to automate it... I assume I can give manual input to the procedure.

19:35 <PeterHansen[m]> James Munns: But the address is in my HEAP, 4 bytes after the start. I think the vtable would be in data somewhere, not heap.

19:35 <JamesMunns[m]> yeah, I mean something has gone terribly wrong

19:35 <JamesMunns[m]> the vtable itself wouldn't end up in the LR

19:37 <PeterHansen[m]> Does cargo bisect-rustc require using a script, or can you give manual input? If script, I guess I just need to make a dummy script that waits for me to enter a response, while I take the generated code and download elsewhere (different host... compile on fast laptop, deploy from raspberry pi with SWD hooked up).

19:39 <PeterHansen[m]> Unfortunately I have a family matter to deal with until Tuesday so I'll need to park this very shortly, but I appreciate all suggestions.

19:40 <PeterHansen[m]> (For more context, in addition to embassy-nrf, I still have nrf-softdevice linked in and am using its critical-section impl, but because I have almost no real code running it is not initializing the SoftDevice or using any peripherals, or really anything except some embassy_time timers, and rtt-target with an up and down channel initialized (but only output being used actively).)

19:41 <JamesMunns[m]> I think it's reasonable to bisect rustc (not sure on your question), but also reasonable to ensure your code is not doing any UB, or something that would otherwise corrupt the running state of your processor.

19:42 <PeterHansen[m]> Pretty sure I have no UB or unsafe code being actively called here, though some could theoretically still be in what's linked but not executed.

19:42 <diondokter[m]> Yeah, embassy might. It's very good, but not always perfect ;)

19:42 <PeterHansen[m]> of my own that is... there are still lots of outside crates being linked in too, though again very little code being executed now.

19:43 <diondokter[m]> Maybe compare the generated assembly yourself between the two compiler versions?

19:43 <PeterHansen[m]> also using opt_level "z". I tried with all others and none fail except "s", so it could be related to s/z optimization.

19:43 <PeterHansen[m]> diondokter: umm. very good thought. Wish I'd had it. :)

19:45 <PeterHansen[m]> For anyone who knows more about hard faults, note that what I posted in the embassy channel mentioned the output of two hard-fault related registers:... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/fAjXrskDuqQlPUjwGNXGXsgM>)

19:48 Guest7282 has joined #rust-embedded

19:51 <PeterHansen[m]> diondokter: There are very significant differences in the compiler builtin aeabi_memset routine, which is the one that appears to be causing the hard fault... interesting.

19:51 <diondokter[m]> Huh

19:51 <JamesMunns[m]> what's the date on the first broken nightly?

19:53 <PeterHansen[m]> It's rustc 1.73.0-nightly (f88a8b71c 2023-08-08)

19:53 <PeterHansen[m]> The one before that works is rustc 1.73.0-nightly (03a119b0b 2023-08-07)

19:54 therealprof[m] has quit [Quit: Idle timeout reached: 172800s]

19:55 <PeterHansen[m]> (Caveat re my comment above about optimization levels: I don't really believe it's related to that... I think it's just that those rearrange the code in such a way that it can cause/remove the failure, just as removing some of my code that isn't even being called can remove the failure... Link order etc shifting things around in memory perhaps.)

19:57 <PeterHansen[m]> Anyone have caveats about running rustc-bisect in a WSL2 (Windows System for Linux) debian host? I've never built rustc or anything like that before...

19:59 <JamesMunns[m]> We did switch from LLVM16 to LLVM17 on Aug 7th: https://github.com/rust-lang/rust/commit/8c1c7d37b29d72bad1f218798d121074918e9616

20:00 <diondokter[m]> Interesting. Also, my guess is that aeabi_memset is something LLVM knows, not something rustc knows

20:01 <JamesMunns[m]> https://github.com/llvm/llvm-project/issues/69629

20:01 <PeterHansen[m]> James Munns: Yes, when I first brought this up several months ago I had identified there was a major change there. I noticed it because my code size had also changed relatively significantly between the two nightlies.... like 1-2K on a 200K code base, which I found a tad surprising.

20:01 <JamesMunns[m]> I wonder if something the memcpy is pulling from is not aligned

20:01 IlPalazzo-ojiisa has quit [Quit: Leaving.]

20:02 <JamesMunns[m]> you'd get a different unaligned fault, but if that was unhandled it might escalate to a hardfault?

20:02 <JamesMunns[m]> you probably don't have a specific unaligned fault handler

20:02 <PeterHansen[m]> James Munns: The hard fault register output seemed to me to imply it wasn't an alignment issue, since I think there's a different bit set for that than INVSTATE. The page I was looking at suggested INVSTATE was only (or primarily?) when the low bit of the address wasn't set or was set incorrectly for a Thumb instruction or something like that.

20:02 <diondokter[m]> Might fall under the memfault exception?

20:02 <diondokter[m]> You might be able to enable stuff with that

20:02 <JamesMunns[m]> Can you try adding a handler for UsageFault specifically?

20:03 <diondokter[m]> Or maybe that one yeah

20:03 <PeterHansen[m]> I was going based on https://interrupt.memfault.com/blog/cortex-m-hardfault-debug

20:03 <JamesMunns[m]> like... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/RfaQpaEmOVqwMfRlGjmURFWy>)

20:04 <JamesMunns[m]> s///, s/irqn: i16//

20:10 <adamgreig[m]> invstate might make sense if somehow pc was loaded with a ram address that didn't have lsb set

20:10 <adamgreig[m]> like someone really messed up a branch

20:10 <PeterHansen[m]> James Munns: If I do that with the generic HardFault handler in place it gives the same failure as before. I can try removing that and leaving just yours.

20:10 <adamgreig[m]> you want to have the UsageFault and you also need to enable usagefault not escalating to hardfault iirc

20:11 <PeterHansen[m]> Yeah, with just that UsageFault but not enabling anything it goes to HardFault still.

20:12 <PeterHansen[m]> Is INVSTATE a configurable UsageFault? I had the impression only alignment and another were configurable.

20:12 <PeterHansen[m]> or is the idea that an alignment issue may not immediately cause the failure, so enabling the usage fault may show a more localized source for the failure?

20:13 <adamgreig[m]> I don't think enabling usagefault handling will change anything significant, right now if it's generating a usagefault due to invstate it's just immediately escalating to the hardfault handler

20:13 <adamgreig[m]> if the fault status register says it was an escalated usagefault, you can still read the usagefault bits out of cfsr I believe

20:14 <JamesMunns[m]> yeah, my hope was to see if it was some other fault that was being escalated to a hardfault

20:15 <PeterHansen[m]> I would have thought that would still leave the other bit set in the CFSR. This has only the one bit.

20:16 <adamgreig[m]> yea, CFSR has the INVSTATE bit set then it was a UsageFault, and HFSR has FORCED set then it escalated usagefault to hardfault since usagefault handling wasn't enabled

20:16 <JamesMunns[m]> I'm certainly not an expert! I'd trust Adam's knowledge over mine :D

20:17 <adamgreig[m]> if you set USGFAULTENA in SHCSR I'd expect it to start calling the UsageFault handler

20:18 <adamgreig[m]> but I don't think that would tell you anything new, the question is why did pc get loaded with an address from sram that happened to not have the LSb set? maybe?

20:18 <adamgreig[m]> does that actually trigger INVSTATE or does only BX style instructions do it though...

20:18 <JamesMunns[m]> ah, but I see what you are saying, the UNALIGNED flag isn;'t set but the INVSTATE flag is

20:18 <JamesMunns[m]> so even if we get a specific usagefault handler, that doesn't change the fact we didn't get triggered by an UNALIGNED read

20:19 <PeterHansen[m]> I seem to be missing something about how rustc-bisect works. If I just run it with the two known bounds `cargo bisect-rustc --start=2023-08-08 --end=2023-08-09` then it just outputs this and bails:... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/QevlQaZFzJUXQgcEREwtFmpD>)

20:19 <PeterHansen[m]> I guess I have to have it do a build then somehow wait for feedback while I install it and run it.

20:19 <adamgreig[m]> if you've already bisected it to a specific nightly I don't know how much you'll learn from rustc-bisect

20:20 <adamgreig[m]> especially when that nightly includes a "bump llvm version" commit :p

20:20 <PeterHansen[m]> It says if its within the last 167 nightlies it will automatically bisect on the PRs.

20:21 <adamgreig[m]> could you see what's in memory at that 0x20005044 address?

20:21 <PeterHansen[m]> adamgreig: It's just 4 bytes into my HEAP, presumably the first thing allocated.

20:21 <PeterHansen[m]> I assume as soon as I format!() it's beginning to build a string in allocated space.

20:21 <adamgreig[m]> wonder what instruction it disassembles into

20:22 <PeterHansen[m]> ah

20:22 <adamgreig[m]> but it does sorta depend what else is going wrong and how pc was loaded

20:22 <adamgreig[m]> the other thing maybe worth doing is putting a breakpoint just before things go bad and single-stepping instruction by instruction until the fault

20:22 <PeterHansen[m]> would it be that location, or one word before or after perhaps?

20:22 <adamgreig[m]> it should be that location

20:23 <adamgreig[m]> the 16 or 32 bits starting there, anyway

20:24 <adamgreig[m]> but yea, if you can get a gdb session going, i'd be trying to single step into the failure, then you'll see exactly what code loaded a ram address into pc and can hopefully work backwards a bit

20:24 <PeterHansen[m]> adamgreig: Here's the words before, at, and after that location:... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/rPnMGeMTjyGsUYpezsEikPun>)

20:25 <PeterHansen[m]> Those are different locations from before, but 52bc is equivalent to the old one... HEAP is at 52b8.

20:25 explodingwaffle1 has joined #rust-embedded

20:25 <explodingwaffle1> <PeterHansen[m]> "I seem to be missing something..." <- > <@peter9477:matrix.org> I seem to be missing something about how rustc-bisect works. If I just run it with the two known bounds `cargo bisect-rustc --start=2023-08-08 --end=2023-08-09` then it just outputs this and bails:... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/BoUefXvZfekhkhDSSpluxhkD>)

20:26 <PeterHansen[m]> So that first word is "None"... as expected I guess.

20:27 <PeterHansen[m]> For reference, this was the trace this time:... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/xIQjFfwyGtTIstOKcWDDJLFi>)

20:28 <PeterHansen[m]> with the LR pointing into here:... (full message at <https://catircservices.org/_matrix/media/v3/download/catircservices.org/oOkAvunHYVxvxHKOzLXfLbtO>)

20:28 <PeterHansen[m]> So just after the memcpy as before.

20:29 <PeterHansen[m]> I suppose dumping regs and analyzing the memcpy code could give a hint what happened, but at that point I suppose single-stepping to it may make more sense. I'd need to learn some gdb then...

20:29 <diondokter[m]> Hmmm, probably best to get the debugger, set a breakpoint on that bl instruction and then step into it and see what it does

20:30 <diondokter[m]> Works in vscode with the cortex-debug plugin as well

20:30 <diondokter[m]> Don't have to use GDB cli

20:30 <PeterHansen[m]> Anyway, very unfortunately I really have to bail for today and tomorrow for a family matter. Much thanks for the input.

20:30 <dirbaio[m]> do you have a self-contained repro?

20:32 <PeterHansen[m]> Not yet. As mentioned I can so far only get it to fail by including a fair chunk of code, even though I've carefully neutered all of it. I have to spawn() two different tasks at the moment, and even though each has a "sleep forever" at the start to prevent anything from running, I still need to include their actual code, and much of what they would call, in order to trigger the failure.

20:33 <PeterHansen[m]> So taking out the spawns, or removing the code following the sleep-forever, makes it go away. That's why I assume it's related to link order etc rather than my actual code for the most part.

20:33 <dirbaio[m]> no unsafe?

20:33 <PeterHansen[m]> I tried starting fresh with a cortex-m-rt-quickstart yesterday but that didn't fail. Will need to continue trying to strip down my own stuff apparently.

20:34 <dirbaio[m]> have you checked whether your stack is overflowing?

20:34 <PeterHansen[m]> No unsafe. Literally almost nothing left except (a) initialize HEAP and RTT and embassy, (b) spawn two tasks that just sleep, and (c) format! a None.

20:34 <PeterHansen[m]> Stack usage is probably about 200 bytes or something... no chance I'm dealing with stack or RAM issues like that.

20:35 <PeterHansen[m]> MSP : 0x2003f840

20:35 <dirbaio[m]> oh wel

20:35 <dirbaio[m]> * oh well

20:36 <PeterHansen[m]> My top of stack is 0x2003FC00

20:36 <dirbaio[m]> if you post a repro I'll take a look

20:36 Orange_Murker[m] has quit [Quit: Idle timeout reached: 172800s]

20:37 <PeterHansen[m]> Thanks. Okay, gotta bail. Not back at this until tomorrow night at earlier. :(

20:39 <PeterHansen[m]> s/earlier/earliest/

21:18 Tyalie7098[m] has quit [Quit: Idle timeout reached: 172800s]

21:39 firefrommoonligh has quit [Quit: Idle timeout reached: 172800s]

23:21 crabbedhaloablut has quit []

23:24 Guest7282 has left #rust-embedded [Error from remote client]