#rust-embedded on 2023-02-11 — irc logs at libera.irclog.whitequark.org

2022-02-07 19:20 ChanServ changed the topic of #rust-embedded to: Welcome to the Rust Embedded IRC channel! Bridged to #rust-embedded:matrix.org and logged at https://libera.irclog.whitequark.org/rust-embedded, code of conduct at https://www.rust-lang.org/conduct.html

00:22 <re_irc> <thejpster> Sorry, I've been fiddling again. I drew ISO and ANSI keyboards as ASCII art because I was still getting confused about the symbolic keycap naming: https://github.com/rust-embedded-community/pc-keyboard/tree/moredocs#keycodes

00:23 <re_irc> <thejpster> Turns out the site I was using as a reference used _different_ symbolic names for the _same_ scancodes depending on whether it was ANSI or ISO.

00:23 <re_irc> <thejpster> I can't be doing with that.

00:24 <re_irc> <thejpster> I promise I'll really try and get 0.7.0 shipped after this round.

01:19 <re_irc> < (@datdenkikniet:matrix.org)> https://github.com/rust-embedded/cortex-m/pull/463

01:21 <re_irc> < (@adamgreig:matrix.org)> how did you find that, ?

01:21 <re_irc> < (@datdenkikniet:matrix.org)> See #embassy-rs:matrix.org (https://matrix.to/#/#embassy-rs:matrix.org) :D

01:22 <re_irc> < (@adamgreig:matrix.org)> well, that must have been fun

01:22 <re_irc> < (@adamgreig:matrix.org)> stupid "do weird non-platform-standard things in init code to appease buggy debuggers" strikes again

01:23 <re_irc> < (@dirbaio:matrix.org)> the issue appeared in a nightly bump, so we spent a looooot of time thinking it was a miscompile 😂

01:24 <re_irc> < (@datdenkikniet:matrix.org)> +(We had a bit of a journey over there)

01:27 <re_irc> < (@dirbaio:matrix.org)> while all it did was change codegen enough to trigger a "strd" :P

01:27 <re_irc> < (@dirbaio:matrix.org)> +on stack

01:29 <re_irc> < (@adamgreig:matrix.org)> I guess it's been broken since 2021-11-22, so c-m-rt 0.7.0 should have been ok but not 0.7.1? still looking through embassy scrollback though, hadn't been up to date on it for a few days obviously

01:29 <re_irc> < (@adamgreig:matrix.org)> https://github.com/rust-embedded/cortex-m/commit/8bf70f59bd7dc12660ab033c54520addd4368361

01:29 <re_irc> < (@adamgreig:matrix.org)> ah yes, literally just got to the line about it being OK on 0.7.0, lol

01:29 <re_irc> < (@dirbaio:matrix.org)> yeah, 0.7.0 works, 0.7.[12] fails

01:32 <re_irc> < (@dirbaio:matrix.org)> i'm VERY surprised this went unnoticed for so long :O

01:34 <re_irc> < (@adamgreig:matrix.org)> yea, did you establish if new nightly codegen just hits it more often than before?

01:35 <re_irc> < (@dirbaio:matrix.org)> I don't think so, it's probably just bad luck

01:35 <re_irc> < (@dirbaio:matrix.org)> in Peter Hansen's code the original bug manifested when bumping nightly-2023-02-04 -> nightly-2023-02-05

01:35 <re_irc> <Peter Hansen> : Something changed in 2023-02-05 affected some optimization relating to having a local 2-element array in a function, and picking an index into it with something that amounts to a boolean.

01:36 <re_irc> < (@dirbaio:matrix.org)> but the other repros reproduce up to nightlies from 2021

01:36 <re_irc> < (@adamgreig:matrix.org)> what target were you building for?

01:36 <re_irc> <Peter Hansen> It was an extremely roundabout path (and hours of work) to get back to the real cause.... thumbv7em-none-eabihf

01:36 <re_irc> < (@dirbaio:matrix.org)> reproduces with all thumbs

01:36 <re_irc> < (@dirbaio:matrix.org)> (didn't test further back than 2021 because lack of "global_asm!" support breaks the build)

01:37 <re_irc> < (@adamgreig:matrix.org)> ah, though 0.7.1 didn't have global_asm and did have the bug, so I guess would have included it

01:38 <re_irc> < (@adamgreig:matrix.org)> was curious if it was a v6 vs v7 thing, as v6 requires 8 byte aligned SP on function entry but v7 doesn't

01:39 <re_irc> < (@dirbaio:matrix.org)> v7 doesn't?

01:39 <re_irc> < (@adamgreig:matrix.org)> but it's plausible that LLVM requires it regardless

01:39 <re_irc> < (@adamgreig:matrix.org)> v7 requires 4 byte SP alignment, but if LLVM requires 8 byte SP we should be setting STKALIGN in CCR to tell the CPU to make it always 8 bytes on interrupt entry too

01:40 <re_irc> < (@adamgreig:matrix.org)> (arm recommend 8 byte alignment anyway)

01:40 <re_irc> < (@jamesmunns:beeper.com)> yo, that's one hell of a bug. Good sleuthing!

01:41 <re_irc> < (@adamgreig:matrix.org)> more tempted to delete the whole thing and tell debuggers to stop being stupid dum-dums, but alas, we fix what we can fix and take the rest with grace I guess

01:41 <re_irc> < (@adamgreig:matrix.org)> same deal with having to manually set the SP a few clock cycles after the CPU does it for us -_-

01:41 <re_irc> < (@jamesmunns:beeper.com)> (is there a citation for "LLVM/ARM requires 8 byte alignment"?)

01:44 <re_irc> < (@dirbaio:matrix.org)> https://developer.arm.com/documentation/ddi0403/d/System-Level-Architecture/System-Level-Programmers--Model/ARMv7-M-exception-model/Stack-alignment-on-exception-entry

01:44 <re_irc> < (@dirbaio:matrix.org)> "some software standards require the stack pointer to be 8-byte aligned"

01:44 <re_irc> < (@dirbaio:matrix.org)> "ARM deprecates implementation or use of 4-byte SP alignment."

01:44 <re_irc> < (@dirbaio:matrix.org)> "the AAPCS requires 8-byte stack pointer alignment on entry to a conforming function"

01:44 <re_irc> < (@adamgreig:matrix.org)> https://github.com/ARM-software/abi-aa/blob/main/aapcs32/aapcs32.rst#6211universal-stack-constraints is the AAPCS reference

01:45 <re_irc> < (@adamgreig:matrix.org)> which is "always 4 byte aligned, and 8 byte aligned on entry to an AAPCS function"

01:45 <re_irc> < (@dirbaio:matrix.org)> https://github.com/ARM-software/abi-aa/blob/main/aapcs32/aapcs32.rst#6212stack-constraints-at-a-public-interface

01:45 <re_irc> < (@dirbaio:matrix.org)> ah yep that does it

01:46 <re_irc> < (@adamgreig:matrix.org)> v7-M guarantees 4-byte alignment but you can set STKALIGN to say it should be 8 byte, though I imagine many implementations make this RAO anyway

01:46 <re_irc> < (@adamgreig:matrix.org)> with -hf target the FP state preservation takes precedence over stkalign too, so

01:47 <re_irc> < (@adamgreig:matrix.org)> I wonder if anything out there doesn't reset it to 1...

01:47 <re_irc> < (@dirbaio:matrix.org)> hopefully not 😰

01:51 <re_irc> < (@jamesmunns:beeper.com)> whew

01:53 <re_irc> < (@adamgreig:matrix.org)> do any other init routines push lr to the stack? I haven't immediately found anyone else doing it but presumably it came from somewhere

01:54 <re_irc> < (@datdenkikniet:matrix.org)> https://github.com/zephyrproject-rtos/zephyr/blob/main/arch/arm/core/aarch32/isr_wrapper.S

01:54 <re_irc> < (@datdenkikniet:matrix.org)> Zephyr seems to, at least

01:55 <re_irc> < (@datdenkikniet:matrix.org)> Oh but that's some ISR thing, so not 100% sure that it applies here

01:56 dne has quit [Remote host closed the connection]

01:56 <re_irc> < (@datdenkikniet:matrix.org)> They do also seem to align it to 8 bytes, though

01:57 dne has joined #rust-embedded

01:58 <re_irc> < (@dirbaio:matrix.org)> there's a related footgun

01:58 <re_irc> < (@dirbaio:matrix.org)> if you set RAM size in memory.x to something not multiple of 8

01:59 <re_irc> < (@dirbaio:matrix.org)> then stack will also be unaligned

02:00 <re_irc> < (@adamgreig:matrix.org)> chibios doesn't: https://github.com/ChibiOS/ChibiOS/blob/master/os/common/startup/ARMCMx/compilers/GCC/crt0_v7m.S#L360

02:00 <re_irc> tockos doesn't: https://github.com/tock/tock/blob/master/arch/cortex-m/src/lib.rs#L360

02:00 <re_irc> zephyr doesn't: https://github.com/zephyrproject-rtos/zephyr/blob/main/arch/arm/core/aarch32/cortex_m/reset.S#L169

02:05 <re_irc> < (@adamgreig:matrix.org)> libopencm3 has a brave "wrote r0 in C", doesn't push LR to the stack, but _does_ set STKALIGN in CCR: https://github.com/libopencm3/libopencm3/blob/8bc483746bd78f2a398f2949420a4128eed5272c/lib/cm3/vector.c#L62

02:06 <re_irc> < (@adamgreig:matrix.org)> "/* Enabled by default on most Cortex-M parts, but not M3 r1 */" 😱

02:08 <re_irc> < (@adamgreig:matrix.org)> ST's HAL (at least for F1, it's a million startup.s files for every part number of course) doesn't push LR, doesn't seem to set STKALIGN (I guess they already know if their parts have it hardcoded) https://github.com/STMicroelectronics/STM32CubeF1/blob/master/Drivers/CMSIS/Device/ST/STM32F1xx/Source/Templates/gcc/startup_stm32f100xb.s#L100

02:11 <re_irc> < (@adamgreig:matrix.org)> this was the original PR in c-m-rt, suggesting it was added to fix a false positive stack overflow in probe-run? https://github.com/rust-embedded/cortex-m-rt/pull/337

02:14 <re_irc> < (@adamgreig:matrix.org)> but presumably people don't have their debuggers unwind past reset in all those other startup files, so I wonder what's going on there

02:14 <re_irc> < (@adamgreig:matrix.org)> if we can avoid wasting 8 bytes of everyone's stack forever that would be nice

02:17 <re_irc> < (@adamgreig:matrix.org)> fair enough that "bl main" at the end of the reset handler will put some PC value from the middle of the reset handler into LR, which indeed overwrites the FFFFFFFF we put into it manually

02:18 <re_irc> < (@dirbaio:matrix.org)> why not "b main" instead?

02:19 <re_irc> < (@9names:matrix.org)> what would that look like when unwinding?

02:20 <re_irc> < (@dirbaio:matrix.org)> that means we'd be relying in the fact the macro enforces main is "-> !"

02:20 <re_irc> < (@dirbaio:matrix.org)> but I think that's fine?

02:20 <re_irc> < (@dirbaio:matrix.org)> unwind would look like: main, then nothing? hopefully

02:20 <re_irc> < (@adamgreig:matrix.org)> the reason for "bl" is that "b" has a very limited jump range

02:21 <re_irc> < (@adamgreig:matrix.org)> 1MB for b, 16MB for bl

02:21 <re_irc> < (@dirbaio:matrix.org)> then ldr+bx?

02:21 <re_irc> < (@adamgreig:matrix.org)> (that's why all those startup files above use bl too)

02:21 <re_irc> < (@adamgreig:matrix.org)> hmm

02:22 <re_irc> < (@dirbaio:matrix.org)> but

02:22 <re_irc> < (@dirbaio:matrix.org)> if everyone uses "bl" and doesn't push LR... we're the odd ones

02:22 <re_irc> < (@adamgreig:matrix.org)> yep

02:22 <re_irc> < (@adamgreig:matrix.org)> I mean, it seems totally reasonable for lr to indicate that reset called main

02:23 <re_irc> < (@adamgreig:matrix.org)> how does the debugger know to stop unwinding when it gets to a stack frame inside reset, though?

02:23 <re_irc> < (@dirbaio:matrix.org)> debuggers can already know main -> reset, that's the LR in main's stack frame

02:24 <re_irc> < (@dirbaio:matrix.org)> then presumably they look at unwind info for reset

02:24 <re_irc> < (@dirbaio:matrix.org)> and see.. nothing?

02:25 <re_irc> < (@dirbaio:matrix.org)> perhaps the issue is that we _do_ have the cfi stuff and the " .type Reset,%function"?

02:26 <re_irc> < (@dirbaio:matrix.org)> all the ones you linked don't

02:26 <re_irc> < (@dirbaio:matrix.org)> so perhaps that's how the debugger knows "this is a random bag of ASM, I should stop trying to unwind"?

02:26 <re_irc> < (@dirbaio:matrix.org)> +not a function,

02:28 <re_irc> < (@adamgreig:matrix.org)> it wouldn't be the lr in main's stack frame if we used bx, though, right?

02:28 <re_irc> < (@dirbaio:matrix.org)> yeah, so that LR in main's frame is 0xFFFF_FFFF

02:29 <re_irc> < (@adamgreig:matrix.org)> all I meant was it seems fair enough to use bl and therefore enter main with an lr pointing back to reset

02:29 <re_irc> < (@dirbaio:matrix.org)> but then you wouldn't see Reset at all in the unwind, that's less nice

02:29 <re_irc> < (@dirbaio:matrix.org)> yeah

02:29 <re_irc> < (@adamgreig:matrix.org)> guess we'll need a quick reproducer setup for the unwinding issue then

02:30 <re_irc> < (@adamgreig:matrix.org)> does probe-run do its own unwinding... wonder when that one decides to stop

02:30 <re_irc> < (@adamgreig:matrix.org)> for the stack alignment check in the second pr, I haven't looked into it but I wonder if we can force a suitable alignment instead?

02:30 <re_irc> < (@adamgreig:matrix.org)> maybe it's fine to just error out but perhaps a more actionable error message would be useful in that case ("check ram size is a multiple of 8" on the second line or something maybe)

02:31 <re_irc> < (@dirbaio:matrix.org)> yeah I thought so too

02:31 <re_irc> < (@adamgreig:matrix.org)> : libopencm3's is written in C so I suppose it effectively does have that :P

02:31 <re_irc> < (@adamgreig:matrix.org)> and tock's is a rust naked function so again I expect should do

02:32 <re_irc> < (@dirbaio:matrix.org)> the libopencm3 one is cursed :P

02:32 <re_irc> < (@dirbaio:matrix.org)> adding asm to Reset to align the stack would waste bytes

02:32 <re_irc> < (@dirbaio:matrix.org)> and doing it in the linker script.. it'd need to align _down_ to 8 bytes, can you even do that?

02:33 <re_irc> < (@adamgreig:matrix.org)> just zero the last few bits?

02:34 <re_irc> < (@adamgreig:matrix.org)> given as it would just mean the last few bytes of ram can never be used anyway in that case I think it's OK to have it error instead

02:35 <re_irc> < (@adamgreig:matrix.org)> if naked functions had stabilised already we could have used them and dropped all the cfi stuff anyway, sigh

02:36 <re_irc> < (@dirbaio:matrix.org)> added more actionable error message :P

02:37 <re_irc> < (@adamgreig:matrix.org)> it's not completely clear to me but it seems like probe-run's unwinder does just wait to see 0xFFFFFFFF? https://github.com/knurling-rs/probe-run/blob/main/src/backtrace/unwind.rs#L91

02:37 <re_irc> < (@adamgreig:matrix.org)> wonder if gdb etc have the same problem

02:38 <re_irc> < (@dirbaio:matrix.org)> yea, but for that it needs reset to have proper cfi (?)

02:38 <re_irc> < (@adamgreig:matrix.org)> : thanks. hopefully anyone overriding _stack_start to some weird value will be able to work out what's up too lol

02:38 <re_irc> < (@dirbaio:matrix.org)> if it lands somewhere with no cfi it prints a scary "things might be corupted" message

02:39 <re_irc> < (@dirbaio:matrix.org)> : ah you can _override_ it

02:39 <re_irc> < (@adamgreig:matrix.org)> naturally 😆

02:39 <re_irc> < (@adamgreig:matrix.org)> https://docs.rs/cortex-m-rt/latest/cortex_m_rt/#_stack_start

02:39 <re_irc> < (@dirbaio:matrix.org)> ERROR(cortex-m-rt): stack start address is not 8-byte aligned.

02:39 <re_irc> By default, stack starts at the end of RAM. Check that both RAM origin and

02:39 <re_irc> length are set to multiples of 8, in the `memory.x` file.

02:40 <re_irc> < (@dirbaio:matrix.org)> better with "by default"

02:40 <re_irc> < (@adamgreig:matrix.org)> looks good, doesn't need the comma after 8 though

02:41 <re_irc> < (@dirbaio:matrix.org)> ERROR(cortex-m-rt): stack start address is not 8-byte aligned.

02:41 <re_irc> If you haven't, stack starts at the end of RAM by default. Check that both RAM

02:41 <re_irc> origin and length are set to multiples of 8 in the `memory.x` file.

02:41 <re_irc> If you have set _stack_start, check it's set to an address multiple of 8 bytes.

02:42 <re_irc> < (@adamgreig:matrix.org)> "set to a multiple of 8 bytes" or "set to an address which is a multiple of 8 bytes" maybe? sorry to nitpick

02:43 <re_irc> < (@dirbaio:matrix.org)> I wonder, if you add a second linker script (like defmt does), does that run "after" link.x? and if so, can it change _stack_start to something bad after the assert has already ran?

02:44 <re_irc> < (@adamgreig:matrix.org)> I think in that case " PROVIDE(_stack_start = ORIGIN(RAM) + LENGTH(RAM));" from our link.x would have already run

02:44 <re_irc> < (@adamgreig:matrix.org)> it's only overridable in the previously-included memory.x in that case?

02:45 <re_irc> < (@adamgreig:matrix.org)> I am not completely confident in this though

02:46 <re_irc> < (@adamgreig:matrix.org)> (and if overridden in memory.x, the assert would happen afterwards so still catch any upsets)

02:46 <re_irc> < (@jamesmunns:beeper.com)> maybe dumb choice - if it can be overridden, do an asm "mask lower three bits, immediately halt if != 0"?

02:46 <re_irc> < (@jamesmunns:beeper.com)> (at runtime start)

02:47 <re_irc> < (@jamesmunns:beeper.com)> (would increase code size like... two instructions?)

02:47 <re_irc> < (@dirbaio:matrix.org)> noooo!you're wasting 4-8 bytes! ☠️

02:47 <re_irc> < (@dirbaio:matrix.org)> * noooo! you're

02:47 <re_irc> < (@jamesmunns:beeper.com)> lmao

02:50 <re_irc> < (@adamgreig:matrix.org)> I'm very reluctant to add that sort of runtime overhead if we can possibly avoid it

02:50 <re_irc> <Peter Hansen> Technically one could have _stack_start be 8-aligned even if neither RAM origin nor length were multiples of 8, as long as their sum was... maybe anyone doing that should be shot though.

02:50 <re_irc> < (@adamgreig:matrix.org)> so long as it's 8 byte aligned it doesn't matter if ram origin and length aren't multiples of 8 anyway though

02:50 <re_irc> < (@adamgreig:matrix.org)> I'd much rather just put in the documentation that if you override _stack_start, it must be a multiple of 8

02:51 <re_irc> <Peter Hansen> : Yeah, that was in reference to the text in the ASSERT added.

02:51 <re_irc> < (@jamesmunns:beeper.com)> fair! I figured "one assert at startup won't hurt", but again, probably not unreasonable to "fix it in docs"

02:51 <re_irc> < (@jamesmunns:beeper.com)> might be fun to have a bunch of "pedantic asserts" in a feature flag we can ask people to turn on when spooky things happen

02:51 <re_irc> < (@jamesmunns:beeper.com)> (can you gate asm blocks on "#[cfg(debug_assertions)]"?)

02:52 <re_irc> < (@jamesmunns:beeper.com)> like checking all the alignment and whatever else preconditions otherwise only specified in docs.

02:52 <re_irc> <Peter Hansen> Maybe better to have the pedantic asserts enabled normally but turnable-off only with an explicit feature.

02:53 <re_irc> < (@adamgreig:matrix.org)> best to avoid "turn _off_ with feature"

02:53 <re_irc> < (@adamgreig:matrix.org)> but also don't want to say "checks only on debug builds" because they're often not usable on embedded anyway and/or would totally obscure the problem

02:53 <re_irc> < (@adamgreig:matrix.org)> a feature that turns on some extra startup checks, akin to the existing set-vtor and set-sp features, I could live with

02:53 <re_irc> < (@jamesmunns:beeper.com)> yeah, could be an explicit flag I guess.

02:54 <re_irc> < (@jamesmunns:beeper.com)> (and really - should be separate from debug assertions for the reasons you said, the more I think about it)

02:56 <re_irc> <Peter Hansen> I was thinking that if they're not merely pedantic, but basically mandatory unless you know what you're doing, then "#[cfg(not(feature = "i_assume_the_risk"))" would make some sense. Maybe some of the checks are merely pedantic, but one where your system is broken like this one, unless you know the implications and take steps to deal with it should be more forceful maybe.

02:56 <re_irc> < (@jamesmunns:beeper.com)> its more about "how cargo works" than "should this be on or off"

02:56 <re_irc> < (@jamesmunns:beeper.com)> ANY dep could set "lol_trust_me" as a feature flag, and the checks will be gone.

02:57 <re_irc> <Peter Hansen> I don't mean the debug assertions aspect, I agree there. I mean this stack_start 8-alignment... it's basically mandatory, except for someone who knows what they're doing, right?

02:57 <re_irc> < (@jamesmunns:beeper.com)> (this is why "disable" features are not recommended in Rust)

02:57 <re_irc> <Peter Hansen> but okay, i see your point

02:57 <re_irc> < (@jamesmunns:beeper.com)> all feature flags are basically "globally OR'd together".

02:58 <re_irc> <Peter Hansen> yeah, bummer :(

02:58 <re_irc> < (@jamesmunns:beeper.com)> Yeah, features are one of those things I wish was different, but sorta get how we ended up with the current impl.

02:59 <re_irc> < (@jamesmunns:beeper.com)> maybe "features2" some day.

03:01 <re_irc> < (@adamgreig:matrix.org)> well, boo, the second link script does just get glued in afterwards, and so if you did override _stack_start in an additional link script it won't trigger the assert

03:01 <re_irc> < (@adamgreig:matrix.org)> so that's a pain but it's also so many levels of cursed

03:01 <re_irc> < (@adamgreig:matrix.org)> ho hum

03:02 <re_irc> < (@dirbaio:matrix.org)> lol

03:02 <re_irc> < (@adamgreig:matrix.org)> however you _can_ put "LONG(_stack_start & 0xFFFFFFF8);" in link.x

03:03 <re_irc> < (@adamgreig:matrix.org)> which (I just tested) does indeed zero the bottom three bits

03:03 <re_irc> < (@adamgreig:matrix.org)> which will round _down_ "_stack_start" to the next lower multiple of 8, so seems ok?

03:03 <re_irc> < (@adamgreig:matrix.org)> needs testing on arm-none-eabi-gcc and arm-none-eabi-ld as well as rustlld though

03:04 <re_irc> < (@adamgreig:matrix.org)> I guess we could turn the error into a warning with that

03:05 <re_irc> < (@adamgreig:matrix.org)> (I don't know that warnings are a thing here though, so I'd probably just keep it as an error)

03:05 <re_irc> < (@dirbaio:matrix.org)> why'd you want to set a nonaligned stack though?

03:06 <re_irc> < (@adamgreig:matrix.org)> it's always a mistake in rust, so I think you wouldn't want that

03:06 <re_irc> < (@adamgreig:matrix.org)> (but the warning is because the linker script would force alignment)

03:06 <re_irc> < (@adamgreig:matrix.org)> (so it's no longer an error, just letting you know you're wasting 1-7 bytes of ram)

03:06 <re_irc> < (@dirbaio:matrix.org)> linker warnings aren't great, cargo doesn't show them unless there's also an error

03:06 <re_irc> < (@adamgreig:matrix.org)> yea

03:06 <re_irc> < (@adamgreig:matrix.org)> hence just keeping it as an error

03:06 <re_irc> < (@adamgreig:matrix.org)> but by masking it in the linker script, we do ensure that even overriding _stack_start in a separate file won't cause this problem

03:07 <re_irc> < (@dirbaio:matrix.org)> ahhh

03:07 <re_irc> < (@dirbaio:matrix.org)> so, the "LONG(_stack_start & 0xFFFFFFF8);" executes "after", even if it's earlier in the scripts than the override?

03:08 <re_irc> < (@dirbaio:matrix.org)> i've never fully understood linker scripts and never will lol

03:09 <re_irc> < (@adamgreig:matrix.org)> yea

03:10 <re_irc> < (@adamgreig:matrix.org)> I mean, if it executed "before" then we wouldn't have a problem because the second linker script couldn't override it

03:14 <re_irc> < (@adamgreig:matrix.org)> well that's https://github.com/rust-embedded/cortex-m/pull/465, let's see what the CI makes of it on the other linkers

03:15 <re_irc> < (@adamgreig:matrix.org)> seems fine locally at least

03:16 <re_irc> < (@adamgreig:matrix.org)> for the main problem, I propose finding out if we can just remove the push-lr-to-stack business entirely, by testing out behaviour of probe-run and gdb against not having it, not having it and also deleting the cfi func stuff, and some binaries for chibios/st hal/etc that don't include it either

03:17 <re_irc> < (@adamgreig:matrix.org)> it seems nicer and more in line with what "everyone else does", just with the caveat that somehow probe-run might need some other way to detect the end of the unwinding?

03:25 <re_irc> < (@jamesmunns:beeper.com)> Does stack_end need to be 8-aligned too?

03:25 <re_irc> < (@jamesmunns:beeper.com)> that'll require a slightly tricker "LONG((_stack_end & 0xFFFFFFF8) + 8);"

03:28 <re_irc> < (@jamesmunns:beeper.com)> I guess we don't ever actually produce a stack end symbol, it'll just be "the top of whatever else gets put in ram"

03:28 <re_irc> < (@jamesmunns:beeper.com)> I thought I saw something in that aapcs doc that talked about stack end, but if we hit it we're probably having a bad day anyway

03:29 <re_irc> < (@adamgreig:matrix.org)> stack _end_ isn't a real concept afaik, there is stklim on armv8 which is a separate really good cool thing but probably not material here

03:30 <re_irc> < (@adamgreig:matrix.org)> (psplim/msplim rather)

03:30 <re_irc> < (@adamgreig:matrix.org)> those registers have the last 3 bytes always 0 anyway

03:30 <re_irc> < (@jamesmunns:beeper.com)> yeah, looking again I can't find anything

03:30 <re_irc> < (@adamgreig:matrix.org)> * bits

03:30 <re_irc> < (@jamesmunns:beeper.com)> so... I should go to bed lol

03:31 <re_irc> < (@adamgreig:matrix.org)> same...

03:31 <re_irc> < (@adamgreig:matrix.org)> good work everyone involved in spotting this, what a subtle bug!

03:34 <re_irc> < (@adamgreig:matrix.org)> ARM have some startup code too but it's very basic https://github.com/ARM-software/CMSIS/blob/master/Device/ARM/ARMCM0/Source/GCC/startup_ARMCM0.S

03:34 <re_irc> < (@jamesmunns:beeper.com)> should probably be pretty noisy about this one, I'd guess?

03:34 <re_irc> < (@jamesmunns:beeper.com)> I don't totally grok when this could trigger, it seemed to require a pretty niche opt?

03:36 <re_irc> < (@jamesmunns:beeper.com)> (noisy once the next c-m-rt release happens, I mean)

03:36 <re_irc> < (@adamgreig:matrix.org)> I think strd/ldrd are the main culprits, but if there was a reproducer on v6 it can't have involved them

03:36 <re_irc> < (@adamgreig:matrix.org)> but yea, let's get it fixed first, then a new c-m-rt release and tweet

03:37 <re_irc> < (@adamgreig:matrix.org)> yea

03:37 <re_irc> < (@jamesmunns:beeper.com)> should just require a "cargo update -p cortex-m-rt" if you're already on cmrt 0.7?

03:37 <re_irc> < (@jamesmunns:beeper.com)> (I think 0.6 happened to be unaffected/was already 8 byte aligned somehow?)

03:40 <re_irc> < (@adamgreig:matrix.org)> 0.6 never included the code to push lr to stack, so yea, it's unaffected

03:40 <re_irc> < (@adamgreig:matrix.org)> 0.7.0 is OK too, only 0.7.1 and 0.7.2 are affected

03:40 <re_irc> < (@adamgreig:matrix.org)> (it's not that we need to take action to align the stack to 8 bytes, it starts out aligned, the problem is that we added an instruction in 0.7.1 that pushes 4 bytes to the stack which unaligned it)

03:41 <re_irc> < (@adamgreig:matrix.org)> it would be interesting to understand what else could be going wrong on v6 that triggers it, hmm

03:42 <re_irc> < (@adamgreig:matrix.org)> https://github.com/ARM-software/abi-aa/blob/main/advnote132/advnote132.rst#the-problem-and-how-to-avoid-it

03:42 <re_irc> < (@adamgreig:matrix.org)> ARM have a tech note about this problem lol

03:44 <re_irc> < (@adamgreig:matrix.org)> but, armv6m doesn't include ldrd/strd, and that doc notes that armv7m doesn't fault on misaligned strd/ldrd

03:44 <re_irc> < (@jamesmunns:beeper.com)> that note reads like LDRD/STRD exist in armv6? Is there just no thumb version?

03:44 <re_irc> < (@jamesmunns:beeper.com)> oh

03:45 <re_irc> < (@adamgreig:matrix.org)> there's a special note about cortex-m at the bottom

03:46 <re_irc> < (@adamgreig:matrix.org)> but it's really just about exceptions

03:46 <re_irc> < (@adamgreig:matrix.org)> so, I think if you were dealing with u128 on the stack it might easily trigger this, but I don't yet understand what was going wrong in the previous reproducers

03:47 <re_irc> < (@jamesmunns:beeper.com)> seemed to only repro with an array of size two, maybe it was using ldrd/strd to load those to registers faster?

03:49 <re_irc> < (@jamesmunns:beeper.com)> their repro case was using "[u32; 2]"

03:49 <re_irc> < (@jamesmunns:beeper.com)> so yeah, doing ldrd/strd for splatting those two from mem to registers makes sense?

03:52 <re_irc> < (@adamgreig:matrix.org)> yea, but ldrd is specifically meant to work fine with a word-aligned address, it doesn't need to be 8-byte aligned on armv7m

03:52 <re_irc> < (@adamgreig:matrix.org)> and it doesn't exist at all on armv6m which apparently also had problems

03:52 <re_irc> <Peter Hansen> I think datdenkikniet had identified an optimization that was applied only in the case of a two-element array, and could involve that strd instruction. The generated code was different before nightly 02-05 so it didn't trigger the optimization which was broken in the face of the non-8-aligned SP.

03:53 <re_irc> < (@adamgreig:matrix.org)> so I don't think the problem is actually the ldrd/strd

03:53 <re_irc> <Peter Hansen> But it got hard to follow the specifics since at some point the minimal example ended up no longer failing just with 02-05 but was failing with any Rust going back over a year.

03:55 <re_irc> <Peter Hansen> I'd just guess that the [u32; 2] situation tends to involve code that may be broken if SP is not 8-aligned. It was likely any of several different instructions involved depending on the specifics of the test code.

03:56 <re_irc> < (@adamgreig:matrix.org)> yea, but I wonder what exactly

03:56 <re_irc> <Peter Hansen> I actually never noticed an strd myself... not sure where picked that one... maybe in his own output.

03:56 <re_irc> < (@adamgreig:matrix.org)> out of interest did you have any u128 anywhere in the troublesome code?

03:56 <re_irc> <Peter Hansen> nope

03:57 <re_irc> < (@adamgreig:matrix.org)> it's not like [u32; 2] needs 8-byte alignment either

03:57 <re_irc> < (@adamgreig:matrix.org)> do you have a complete minimal reproducer anywhere?

03:57 <re_irc> < (@jamesmunns:beeper.com)> This was the bad repro I think?

03:57 <re_irc> 39b52: 9505 strr5, [sp, #20]

03:57 <re_irc> 39b50: 1f48 subsr0, r1, #5

03:57 <re_irc> ... long message truncated: https://psion.agg.io/_matrix/media/r0/download/psion.agg.io/QuVKadl2KSOmAZ4V9XzxQWFR3vtkTsTR (18 lines)

03:57 <re_irc> ; let modulator_id = match data_marker {

03:57 <re_irc> <Peter Hansen> no, but clearly the code that was being generated only worked with it.

03:57 <re_irc> <Peter Hansen> : https://github.com/peter9477/test2

03:57 <re_irc> < (@adamgreig:matrix.org)> thanks

03:57 <re_irc> < (@jamesmunns:beeper.com)> : if the compiler "knew" the stack was 8 byte aligned on entry, and wanted to do an 8 byte aligned trick with it, it could assume it was free to?

03:58 <re_irc> < (@adamgreig:matrix.org)> yes, that's right

03:58 <re_irc> < (@adamgreig:matrix.org)> but what trick is it doing?

03:58 <re_irc> < (@adamgreig:matrix.org)> the assumption earlier this evening was that as soon as we hit the strd instruction you get UB if it's not 8-byte aligned, but that's not true, so it must be something else

03:58 <re_irc> < (@adamgreig:matrix.org)> (plus that one you just posted doesn't have any strd etc)

04:00 <re_irc> < (@jamesmunns:beeper.com)> yeah, worth re-examining

04:08 <re_irc> < (@jamesmunns:beeper.com)> full dis asm: https://gist.github.com/jamesmunns/3313199fee70c012fb82a1ff361ee8c9

04:08 <re_irc> < (@jamesmunns:beeper.com)> body: https://gist.github.com/jamesmunns/3313199fee70c012fb82a1ff361ee8c9#file-cmrt-repro-txt-L219-L230

04:09 <re_irc> < (@jamesmunns:beeper.com)> main, in case call site context is relevant: https://gist.github.com/jamesmunns/3313199fee70c012fb82a1ff361ee8c9#file-cmrt-repro-txt-L54-L98

04:09 <re_irc> < (@jamesmunns:beeper.com)> (note: I didn't run this on hardware, can't guarantee this is a repro case, and that upsets me, lemme go grab a dev board)

04:13 <re_irc> < (@jamesmunns:beeper.com)> okay yeah, does repro on hw, that disasm contains whatever bug

04:14 <re_irc> < (@jamesmunns:beeper.com)> (thank you Peter Hansen for such an easy repro setup!)

04:19 <re_irc> < (@jamesmunns:beeper.com)> (also man that generated "udf" asm is funny)

04:19 <re_irc> < (@jamesmunns:beeper.com)> (function prelude, jump to ext asm which

04:20 <re_irc> < (@jamesmunns:beeper.com)> +is two udfs and no return, then call another udf for good measure)

04:30 <re_irc> < (@jamesmunns:beeper.com)> Yeah, I think that opt is only valid when you are align 8

04:32 <re_irc> < (@jamesmunns:beeper.com)> it's basically doing:

04:32 <re_irc> retval = *(slice.as_mut_ptr() | (bool as usize << 2 as *mut _));

04:33 <re_irc> < (@jamesmunns:beeper.com)> which only works when your base addr is 0 and you can add 4 by or-ing the bits instead of adding

04:34 <re_irc> < (@jamesmunns:beeper.com)> abusing orr.w for the pointer math, followed by the "ldrr1, [r1, #0]" (the deref in my example), which then gets packed in by stmia to the return value (I guess the stack dest?)

04:34 <re_irc> < (@adamgreig:matrix.org)> it's fractionally worse than that, even

04:35 <re_irc> < (@adamgreig:matrix.org)> because it puts that two-u32 array on the stack, it starts by decreasing sp by 8, then writes 99 to SP-4, 3 to SP-8, and returns *((SP-8) | (x << 2))

04:35 <re_irc> < (@adamgreig:matrix.org)> but yea, same consequence, only valid when SP is 8-byte aligned

04:36 <re_irc> < (@adamgreig:matrix.org)> I guess it's irrelevant that it has sp-8 because if the array had more than two elements you could ignore the rest when indexing with a bool anyway

04:36 <re_irc> < (@jamesmunns:beeper.com)> ah, yeah, I didn't super grok the first "sub sp #8"

04:36 <re_irc> < (@adamgreig:matrix.org)> but yea, it's that optimisation ORing the SP with x<<2

04:37 <re_irc> < (@jamesmunns:beeper.com)> I guess that's like 6 bytes with the "orr.w + ldr r1 [r1, 0]", vs like 8 for an add?

04:38 <re_irc> < (@jamesmunns:beeper.com)> or I dunno. it looks like it's llvm doing an "assume stack is 8-aligned" operation, not any wrong asm

04:38 <re_irc> < (@jamesmunns:beeper.com)> it's just stickin real hard to aapcs, which we weren't honoring.

04:38 <re_irc> < (@adamgreig:matrix.org)> so yea, SP-8 is the value to return when x is 0, SP-4 is the value to return when x=1, and ORing (x<<2) is like +4 when x==1

04:38 <re_irc> < (@jamesmunns:beeper.com)> (and then stmia as a 3-word memcpy basically

04:38 <re_irc> < (@jamesmunns:beeper.com)> * basically)

04:39 <re_irc> < (@adamgreig:matrix.org)> if SP-8 is 8-byte aligned the bottom 3 bits are 0 so you can always add 4 by ORing 100, if it's only 4 byte aligned then that third bit might already be 1, and ORing another 1 in will have no effect

04:39 <re_irc> < (@adamgreig:matrix.org)> hence always getting 99 returned

04:39 <re_irc> < (@adamgreig:matrix.org)> well, that makes more sense, at least

04:40 <re_irc> < (@adamgreig:matrix.org)> unfortunately yea that can clearly happen on v6/v7/v8 and presumably llvm has had this optimisation for some time

04:40 <re_irc> < (@jamesmunns:beeper.com)> yep

04:40 <re_irc> < (@jamesmunns:beeper.com)> maybe yank those old vers with bad align

04:41 <re_irc> < (@jamesmunns:beeper.com)> it seems super a thing the compiler is able to do and it is clear we are wrong

04:41 <re_irc> < (@adamgreig:matrix.org)> I don't think it's relevant that you _return_ the two-byte array either, in fact it's missing an optimisation opportunity to only write the array into memory once

04:41 <re_irc> < (@adamgreig:matrix.org)> right now it writes it into stack, selects one value or the other, then writes that selected value and the two array values back to the stack after

04:41 <re_irc> < (@adamgreig:matrix.org)> oh yea, they're defo getting yanked

04:41 <re_irc> < (@jamesmunns:beeper.com)> yeah we have no idea the space of what can trigger this

04:41 <re_irc> < (@jamesmunns:beeper.com)> so we have to assume anything can lol

04:43 <re_irc> < (@jamesmunns:beeper.com)> (maybe the limit is how much you can stmia back? I dunno the max number of pushes you can chain for that, but array of two makes sense because you can only abuse 8 aligned bytes. might show up for u8 arrays smaller than "[u8; 8]" too)

04:44 <re_irc> < (@jamesmunns:beeper.com)> anyway, wild guesses, bed time

04:45 <re_irc> < (@jamesmunns:beeper.com)> (okay that "orr.wr1, r2, r1, lsl #2" is actually super cool)

05:07 <re_irc> < (@adamgreig:matrix.org)> I dunno if the stmia matters, you can always just write multiple stm statements and this trick still saved you time

05:08 <re_irc> < (@adamgreig:matrix.org)> the version I compiled doesn't actually stmia at all (I think that's changed on nightly? but not on stable?

05:08 <re_irc> < (@adamgreig:matrix.org)> oh, perhaps it is actually stmia under the hood

05:09 <re_irc> < (@adamgreig:matrix.org)> hmm, no, the stores relating to this optimisation are all individual, because it ended up not allocating the registers in order

05:30 jcroisant has joined #rust-embedded

06:41 fabic has quit [Ping timeout: 252 seconds]

08:11 fabic has joined #rust-embedded

09:12 <re_irc> < (@datdenkikniet:matrix.org)> Came across https: //github.com/rust-embedded/cortex-m-rt/issues/139 which seems to have some relevant info wrt pushing lr

09:13 <re_irc> < (@datdenkikniet:matrix.org)> Came across https://github.com/rust-embedded/cortex-m-rt/issues/139 which seems to have some relevant info when it comes to pushing "lr"

09:59 jcroisant has quit [Quit: Connection closed for inactivity]

10:15 IlPalazzo-ojiisa has joined #rust-embedded

10:45 fabic has quit [Ping timeout: 260 seconds]

10:47 <re_irc> < (@datdenkikniet:matrix.org)> https://psion.agg.io/_matrix/media/r0/download/matrix.org/SWJaMlDCcxeHHBxXkNgCZxqQ/image.png

10:47 <re_irc> < (@datdenkikniet:matrix.org)> Did some digging, and this is what GDB seems to say

10:50 <re_irc> < (@datdenkikniet:matrix.org)> * say. It definitely seems to assume that "lr"will be 0xFFFF_FFFF when handling exceptions/and the like, but I've not quite figured if it always uses that to determine the end of a stack frame

10:50 <re_irc> < (@datdenkikniet:matrix.org)> there's a lot of caching going on so the code is rather tricky to follow

10:51 <re_irc> < (@datdenkikniet:matrix.org)> This (https://sourceware.org/git/?p=binutils-gdb.git;a=blob;f=gdb/arm-tdep.c;h=347f3e6b3077f2c676cc15c170cf62b6b1ea967d;hb=HEAD#l3466) is where I found that

10:52 <re_irc> < (@datdenkikniet:matrix.org)> not super related to how we can fix the issue, but may give some insight as to how/why probe-run assumes that 0xFFFF_FFFF must mean that it's at the end

10:55 <re_irc> < (@datdenkikniet:matrix.org)> Did some digging, and this is what GDB seems to say. It definitely seems to assume that "lr"will be 0xFFFF_FFFF when handling exceptions and the like, but I've not quite figured if it always uses that to determine the end of a stack frame

11:06 <re_irc> < (@datdenkikniet:matrix.org)> Did some digging, and this is what GDB seems to say. It definitely seems to assume that "lr"will be 0xFFFF_FFFF eventually when handling exceptions and the like, but I've not quite figured if it always uses that to determine the end of a stack frame

11:22 IlPalazzo-ojiisa has quit [Remote host closed the connection]

11:31 <re_irc> < (@datdenkikniet:matrix.org)> : I think it ends up being something along the lines of, for GDB at least:

11:31 <re_irc> 1. If "lr" 0xFFFF_FFFF, there's an arm-specific thing that indicates that this is now the end of the frame (as per my screenshot).

11:31 <re_irc> 2. If it's not, it keeps going for all cached values of "lp", and each value of "lp" is then used to construct a frame ID. If the frame ID (= eventually "lr" value) is the same for two frames, it's done.

11:31 <re_irc> < (@datdenkikniet:matrix.org)> definitely feels like a probe-run problem that should not have to be fixed in cortex-m-rt

11:31 <re_irc> < (@datdenkikniet:matrix.org)> I think it ends up being something along the lines of, for GDB at least:

11:31 <re_irc> 1. If "lr" 0xFFFF_FFFF, there's an arm-specific thing that indicates that this is now the end of the frame (as per my screenshot).

11:31 <re_irc> 2. If it's not, it keeps going for all cached values of "lp" (a finite amount), and each value of "lp" is then used to construct a frame ID. If the frame ID (= eventually "lr" value) is the same for two frames, it's done.

11:32 <re_irc> < (@datdenkikniet:matrix.org)> definitely feels like a probe-run problem that should not have to be fixed/accounted for in cortex-m-rt

11:32 <re_irc> < (@datdenkikniet:matrix.org)> I think it ends up being something along the lines of, for GDB at least:

11:32 <re_irc> 1. If "lr" 0xFFFF_FFFF, there's an arm-specific thing that indicates that this is now the end of the frame (as per my screenshot).

11:32 <re_irc> 2. If it's not, it keeps going for all cached values of "lp" or "sp" (not entirely sure which, but a finite amount), and each of these values is then used to construct a frame ID. If the frame ID (= eventually "lr" value) is the same for two frames, it's done.

11:33 <re_irc> < (@datdenkikniet:matrix.org)> I think it ends up being something along the lines of, for GDB at least:

11:33 <re_irc> 1. If "lr" 0xFFFF_FFFF, there's an arm-specific thing that indicates that this is now the end of the frame (as per my screenshot).

11:33 <re_irc> 2. If it's not, it keeps going for all cached values of "lp" or "sp" (not entirely sure which, but a finite amount), and each of these values is then used to construct a frame ID. If the frame ID (= eventually "lr" or "sp" value) is the same for two frames, it's done.

11:38 <re_irc> < (@datdenkikniet:matrix.org)> on top of that, the backtrace analysis that "probe-run" does seems to be perfectly capable of detecting #2 as well (takes about 4 lines of code to add), so it wouldn't be difficult to solve either. I don't know how to go about proving/showing that it actually works the way it's supposed to

11:39 <re_irc> < (@datdenkikniet:matrix.org)> https://psion.agg.io/_matrix/media/r0/download/matrix.org/guOpZzdAtDWmjSpiZVqdqaXU/image.png

11:39 <re_irc> < (@datdenkikniet:matrix.org)> PLR = previous LR value

11:43 <re_irc> 1. If "lr" 0xFFFF_FFFF, there's an arm-specific thing that indicates that this is now the end of the frame (as per my screenshot).

11:43 <re_irc> < (@datdenkikniet:matrix.org)> I think it ends up being something along the lines of, for GDB at least:

11:43 <re_irc> 2. If it's not, it keeps going for all cached values of "lp" or "sp" (not entirely sure which, but a finite amount), and each of these values is then used to construct a frame ID. If the frame ID (= directly correlated "lr" or "sp" value) is the same for two frames, it's done.

11:46 <re_irc> 1. If "lr" 0xFFFF_FFFF, there's an arm-specific thing that indicates that this is now the end of the frame (as per my screenshot).

11:46 <re_irc> < (@datdenkikniet:matrix.org)> I think it ends up being something along the lines of, for GDB at least:

11:46 <re_irc> 2. If it's not, it keeps going for all cached values of "lp" or "sp" (not entirely sure which, but a finite amount), and each of these values is then used to construct a frame ID. If the frame ID (= directly correlated "lr" or "sp" value) is the same for two frames, it's done because it's reached the outermost value (see here...

11:46 <re_irc> ... (https://sourceware.org/git/?p=binutils-gdb.git;a=blob;f=gdb/frame-unwind.c;h=76601faa4797ae92a51875e53f9f80ab46f47b3a;hb=HEAD#l228)).

11:48 <re_irc> < (@datdenkikniet:matrix.org)> https://psion.agg.io/_matrix/media/r0/download/matrix.org/wHWnNdmfojsfuVKMxxzWMvrE/image.png

11:48 <re_irc> < (@datdenkikniet:matrix.org)> for more context: it actually produces a correct backtrace without pushing "lr" to the stack through "cortex-m-rtic"

11:49 <re_irc> < (@datdenkikniet:matrix.org)> * "cortex-m-rt"

11:50 <re_irc> < (@datdenkikniet:matrix.org)> must note that I've not actually run this example with GDB, so whether it actually does or doesn't work with that I don't know

11:55 <re_irc> < (@datdenkikniet:matrix.org)> this (https://github.com/datdenkikniet/probe-run/tree/fix_lr_backtrace) seems to be all that's required to fix it, but I have no idea _why_ the LR value repeats when it gets to the end.

11:56 <re_irc> < (@datdenkikniet:matrix.org)> * (https://github.com/datdenkikniet/probe-run/commit/90a6d9634116d45829bae3a500b6d497b65191dc5829bae3a500b6d497b65191dc)

11:56 <re_irc> < (@datdenkikniet:matrix.org)> * (https://github.com/datdenkikniet/probe-run/commit/90a6d9634116d45829bae3a500b6d497b65191dc)

11:58 <re_irc> < (@datdenkikniet:matrix.org)> * end, nor if that can be guaranteed somehow.

12:39 dc740 has quit [Remote host closed the connection]

12:39 dc740 has joined #rust-embedded

13:01 <re_irc> < (@datdenkikniet:matrix.org)> to confirm: in GDB, I'm quite certain that this line (https://sourceware.org/git/?p=binutils-gdb.git;a=blob;f=gdb/frame.c;h=c69a3ea0cb08525c90ac91546c32789d6c565e50;hb=HEAD#l2324) should perform a callback to this function (https://sourceware.org/git/?p=binutils-gdb.git;a=blob;f=gdb/frame-unwind.c;h=76601faa4797ae92a51875e53f9f80ab46f47b3a;hb=HEAD#l228) through this callback hook set by the target...

13:02 <re_irc> ... (https://sourceware.org/git/?p=binutils-gdb.git;a=blob;f=gdb/arm-tdep.c;h=347f3e6b3077f2c676cc15c170cf62b6b1ea967d;hb=HEAD#l3936), which just performs #2

13:03 <re_irc> < (@datdenkikniet:matrix.org)> Ah wait, no it doesn't....

13:09 <re_irc> < (@datdenkikniet:matrix.org)> +🤦♂️

13:14 <re_irc> < (@datdenkikniet:matrix.org)> definitely feels like it may be a probe-run problem that should not have to be fixed/accounted for in cortex-m-rt

13:16 <re_irc> < (@datdenkikniet:matrix.org)> I think it ends up being something along the lines of, for GDB at least:

13:16 <re_irc> 1. If "lr" 0xFFFF_FFFF, there's an arm-specific thing that indicates that this is now the end of the frame (as per my screenshot).

13:16 <re_irc> 2. If it's not, it keeps going for all cached values of "lp" or "sp" (not entirely sure which, but a finite amount), and each of these values is then used to construct a frame ID. If the frame ID (= directly correlated "lr" or "sp" value) is the same "outer frame value", it's done (see here (https://sourceware.org/git/?p=binutils-gdb.git;a=blob;f=gdb/frame-unwind.c;h=76601faa4797ae92a51875e53f9f80ab46f47b3a;hb=HEAD#l228)).

13:20 <re_irc> < (@datdenkikniet:matrix.org)> Ah wait, no it doesn't perform #2, but does somehow detect the outer-most frame.... 🤦♂️

13:20 <re_irc> < (@datdenkikniet:matrix.org)> Ah wait, no it doesn't perform #2, but does somehow detect the outer-most frame by comparing it to a static value.... 🤦♂️

13:47 <re_irc> Is there a way to shift all elements in a byte buffer as if it were a bytestream?

13:47 <re_irc> < (@monacoprinsen:matrix.org)> Hello,

13:47 <re_irc> If I shifts the elements individualy by << 1 for example it is not the same since the leading values do not carry over to the next element.

13:47 <re_irc> Right now I would achieve this by converting the buffer to a u32 for example and then performing the shift on that value. However I need it back in a buffer to perform a write operation.

13:47 <re_irc> Must be a more streamlined way.

14:02 <re_irc> < (@jamesmunns:beeper.com)> Maybe see if bitvec has something for that?

14:10 fabic has joined #rust-embedded

14:22 <re_irc> < (@datdenkikniet:matrix.org)> Ah wait, no it doesn't perform #2, but does somehow detect the outer-most frame by comparing it to a static value.... 🤦♂️ I guess it will have the exact same issue then. It also does a "WARN two stack frames were the same"

14:51 <re_irc> < (@datdenkikniet:matrix.org)> Ah wait, no it doesn't perform #2, but does try to somehow detect the outer-most frame by comparing it to a static value.... 🤦♂️ I guess it will have the exact same issue then. It also does a "WARN two stack frames were the same"

15:26 <re_irc> <henrik_alser> : You could use rotate_left (or right) on the array

15:27 <re_irc> <henrik_alser> Or heapless had HistoryBuffer

15:27 <re_irc> <henrik_alser> * has

15:46 <re_irc> < (@jamesmunns:beeper.com)> I think? they want to shift an array as if it was one big integers, e.g. the msb of one u32 becomes the lsb of the next u32 in the array (or the other way around)

15:46 <re_irc> < (@jamesmunns:beeper.com)> * integer,

15:55 emerent has quit [Ping timeout: 252 seconds]

15:55 <re_irc> <henrik_alser> Gotcha! Misunderstood the use case

15:56 emerent has joined #rust-embedded

16:01 <re_irc> < (@monacoprinsen:matrix.org)> : Thanks I will have a look!

16:02 <re_irc> < (@monacoprinsen:matrix.org)> : Correct 😊

16:02 Foxyloxy has joined #rust-embedded

16:06 Foxyloxy___ has quit [Ping timeout: 260 seconds]

16:06 Foxyloxy_ has joined #rust-embedded

16:09 Foxyloxy has quit [Ping timeout: 260 seconds]

17:19 <re_irc> <thejpster> Does anyone here know anything about 109-key Japanese keyboards?

17:20 fabic has quit [Ping timeout: 248 seconds]

17:28 IlPalazzo-ojiisa has joined #rust-embedded

19:56 dc740 has quit [Remote host closed the connection]

20:33 xnor has quit [Ping timeout: 268 seconds]

20:50 <re_irc> <Cory Frenette> Hello! 👋

20:50 <re_irc> I'm trying to get into embedded programming using Rust by writing a "no_std" "embedded-hal" device driver and I'm a little stuck at the moment. Hoping someone here might be able to give me a nudge in the right direction. Details in thread

20:50 <re_irc> <Cory Frenette> - I've got a large number (maybe? I'm new to embedded... It's 60+) of device registers that will be used to configure the behavior of the device.

20:51 <re_irc> - I'm using the bitflags crate to generate struct wrappers/api for the registers

20:51 <re_irc> - Because many combinations of register state are invalid, I want to implement a builder to hold pending configuration changes and validate on calling ".build()" before writing the updated state to their respective registers.

20:51 <re_irc> <Cory Frenette> I'd like to write this to be as generic as possible so I want to avoid heap allocation.

20:51 <re_irc> The problem with this approach: I can't figure out a way to determine which registers contain changes (and therefore need to be written) without allocation. I keep finding a need reach for a trait object shared by the register structs. I definitely feel like I'm doing something wrong here. Are there more idiomatic ways of doing what I'm trying to do or am I just basically going to have to generate an if statement for each...

20:51 <re_irc> ... register struct?

20:56 <re_irc> <thejpster> 60 registers, so give them 1 bit each in a u64?

21:21 <re_irc> <Cory Frenette> true, to represent changed or not, but at the end of the day I'm still going to have to write something like

21:21 <re_irc> interface.write(MyRegisterN)?;

21:21 <re_irc> }

21:21 <re_irc> times 60+, no? If that's what must be done, that's fine, we have macros.

21:22 <re_irc> <Cory Frenette> true, to represent changed or not, but at the end of the day I'm still going to have to write something like

21:22 <re_irc> interface.write(MyRegisterN)?;

21:22 <re_irc> if bit_set {

21:22 <re_irc> }

21:22 <re_irc> times 60+, no? If that's what must be done, that's fine, we have macros.

21:33 <re_irc> <Cory Frenette> In my fantasy solution, I am kind of hoping for an iterator over (owned? borrowed?) register structs or similar, but I think that would require a collection of trait objects

21:57 IlPalazzo-ojiisa has quit [Quit: Leaving.]

23:47 xnor has joined #rust-embedded