#rust-embedded on 2021-12-08 — irc logs at libera.irclog.whitequark.org

00:00 <re_irc> <@adamgreig:matrix.org> it's complicated...

00:00 <re_irc> <@adamgreig:matrix.org> mostly I think there's one set of c-m peripherals per core

00:00 <re_irc> <@adamgreig:matrix.org> and each core sees them at the same address

00:00 <re_irc> <@adamgreig:matrix.org> but some are shared by both cores iirc

00:01 <re_irc> <@adamgreig:matrix.org> and anyway it's not clear how to even conceptualise multicore execution in bare-metal rust right now

00:01 <re_irc> <@9names:matrix.org> 9names: Can confirm that pinning signal-hook to 0.3.10 builds fine, so it's a new regression.

00:03 <re_irc> <@dirbaio:matrix.org> adamgreig: wot

00:03 <re_irc> <@dirbaio:matrix.org> isn't everything at 0xExxx_xxxx core-local?

00:05 <re_irc> <@adamgreig:matrix.org> hmm, you're probably right

00:05 <re_irc> <@adamgreig:matrix.org> I didn't remember if stuff like MPU was shared but I guess it wouldn't make sense

00:06 <re_irc> <@dirbaio:matrix.org> it's not

00:06 <re_irc> <@dirbaio:matrix.org> MPU is checks the core does before the access goes out to the bus

00:07 <re_irc> <@adamgreig:matrix.org> so what you're saying is the cortex-m peripherals needs to be given out once per core but the device peripherals need to be given out once per device to be safe...

00:07 <re_irc> <@adamgreig:matrix.org> ...but we might have both cores running code from one elf, or might have one elf per core on the same device....

00:07 <re_irc> <@adamgreig:matrix.org> ....so it's a completely lost cause to try and provide singletons for the device peripherals

00:08 <re_irc> <@dirbaio:matrix.org> yeah it's mega cursed

00:09 <re_irc> <@dirbaio:matrix.org> but again it's up to the definition of "program"

00:10 <re_irc> <@dirbaio:matrix.org> if both cores run the same "program" then you have two "mains"

00:10 <re_irc> <@grantm11235:matrix.org> I'm starting to think that the best option might just be for `take` to be unsafe

00:10 <re_irc> <@dirbaio:matrix.org> so for device peripherals, you have one `Peripherals::take()` that must give out only one instance per program

00:11 <re_irc> <@dirbaio:matrix.org> so it needs to use atomics, or a multicore-sound critical section

00:11 <re_irc> <@dirbaio:matrix.org> and for core peripherals well.... you don't

00:11 <re_irc> <@grantm11235:matrix.org> What about device peripherals that are core-local?

00:11 <re_irc> <@adamgreig:matrix.org> you might not even have two mains if the second core starts off and gets turned on by passing it a function to run or something

00:11 <re_irc> <@dirbaio:matrix.org> yeah

00:12 <re_irc> <@dirbaio:matrix.org> the rp-hal folks are doing it that way, it's super cool

00:12 <re_irc> <@dirbaio:matrix.org> you can start core1 from core0, giving it a closure

00:12 <re_irc> <@adamgreig:matrix.org> yea, it seems nice, like starting a thread on a hosted platform

00:12 <re_irc> <@dirbaio:matrix.org> so you can "pass" some singletons from core0 to core1

00:12 <re_irc> <@adamgreig:matrix.org> that's what I can't remember if the h7 can do

00:12 <re_irc> <@adamgreig:matrix.org> you can start the cm4 from the cm7 and v.v., but can you start it at some arbitrary address?

00:12 <re_irc> <@dirbaio:matrix.org> even if the hw can't do it, you can make a runtime that can

00:12 <re_irc> <@adamgreig:matrix.org> I guess you can do it the same way, right

00:12 <re_irc> <@adamgreig:matrix.org> yea

00:12 <re_irc> <@adamgreig:matrix.org> the rp0 only does it because the rom for the second core is a spinloop waiting on a semaphore right?

00:13 <re_irc> <@grantm11235:matrix.org> dirbaio:matrix.org: I think that's how ISRs should work too

00:13 <re_irc> <@dirbaio:matrix.org> in the rp2040 both cores boot, but the ROM traps core1 in a loop until core0 sends it a start addr

00:13 <re_irc> <@adamgreig:matrix.org> nice to have that in the rom though

00:14 <re_irc> <@adamgreig:matrix.org> on the h7 it would depend on what the option bytes were set to and you'd need to dump a little startup code into some random far away flash address that's the cm4 boot

00:14 <re_irc> <@dirbaio:matrix.org> hehe

00:14 <re_irc> <@adamgreig:matrix.org> I guess the HAL could set the flash option bytes but it's a bit surprising

00:15 <re_irc> <@adamgreig:matrix.org> I think the cm7 can kill/reset the cm4 so I guess it could be done

00:15 <re_irc> <@dirbaio:matrix.org> can you have them both boot from the same vectors then __start queries "which core am I"?

00:15 <re_irc> <@adamgreig:matrix.org> in pre_init so they don't fight over initialising sram I guess

00:16 <re_irc> <@dirbaio:matrix.org> yeah, or custom rt

00:16 <re_irc> <@adamgreig:matrix.org> seems a bit unnecessary to have a custom rt just for that

00:16 <re_irc> <@dirbaio:matrix.org> because they'll fight over the stack

00:16 <re_irc> <@dirbaio:matrix.org> yeah :S

00:16 <re_irc> <@adamgreig:matrix.org> would be nice if c-m-rt was a bit more flexible about extending it to prevent people needing to customise it

00:17 <re_irc> <@adamgreig:matrix.org> wish the h7 rm had a bit more detail about the dual core setup, it's not really explained anywhere i've seen

00:17 <re_irc> <@adamgreig:matrix.org> I couldn't say how you even query which core you are

00:17 <re_irc> <@adamgreig:matrix.org> I guess read your own coresight ID tables, heh

00:18 <re_irc> <@dirbaio:matrix.org> the rp2040 has a reg you can read from 0xDxxxx :)

00:19 <re_irc> <@adamgreig:matrix.org> and a datasheet that actually tells you all about the two cores

00:19 <re_irc> <@dirbaio:matrix.org> lol

00:20 <re_irc> <@adamgreig:matrix.org> I guess at least on the h7 they're two different types of core

00:20 <re_irc> <@adamgreig:matrix.org> so you can just read the "am i a cortex-m7" or "am i a cortex-m4"

00:34 <re_irc> <@adamgreig:matrix.org> adamgreig: fine, I looked and found the 36 page app note on the dual core architecture, the 24 page app note on the inter-core communication, and 45-page note on debugging dual-core stuff

00:40 <re_irc> <@grantm11235:matrix.org> What if each hal implemented it's own runtime, using `cortex-m-rt` as a "runtime-building toolbox"

00:40 <re_irc> <@dirbaio:matrix.org> sometimes it's the user that wants custom stuff though

00:40 <re_irc> <@dirbaio:matrix.org> like using the HAL under some RTOS

00:41 <re_irc> <@grantm11235:matrix.org> The hal could make the rt feature optional

00:42 <re_irc> <@grantm11235:matrix.org> Then the user could use `cortex-m-rt` to make their own custom rt

00:42 <re_irc> <@adamgreig:matrix.org> some hals more or less already do this, right? they define a default memory.x and depend on cortex-m-rt

00:42 <re_irc> <@adamgreig:matrix.org> user still has to pass the linker flag though

00:43 <re_irc> <@adamgreig:matrix.org> but anyway does it buy you anything? the HAL would decide it's safe if it's providing the reset vector something?

00:44 <re_irc> <@grantm11235:matrix.org> The hal could take a `fn(hal::Periperals) -> !` as its main program

00:47 <re_irc> <@grantm11235:matrix.org> And then start a second core the same way that rpi does it

00:56 Foxyloxy has quit [Ping timeout: 256 seconds]

01:04 Foxyloxy has joined #rust-embedded

01:15 troth has quit [Ping timeout: 240 seconds]

01:23 <re_irc> <@firefrommoonlight:matrix.org> Re some of the dual core chat in the `stm32-rs` room and the discussion here: I'm curious how to set up smart abstractions for dual core. I may dive into a project that does this, and will hopefully work through some ideas. It appears that `cortex-m-rt`, in conjunction to specifying the correct addressed in `memory.x`, will let you flash either core, with standalone programs

01:23 <re_irc> <@firefrommoonlight:matrix.org> This isn't really ideal from an application perspective (For the uses I'm envisioning), compared to a single program

01:23 <re_irc> <@firefrommoonlight:matrix.org> So, the open question is, how to set up abstractions so the programs "talk" to each other in an organized, easy-to-code way

01:23 <re_irc> <@adamgreig:matrix.org> it shouldn't be too awful to put together something that will boot both cores from a single elf

01:23 <re_irc> <@adamgreig:matrix.org> be a bit hacky at first though

01:24 <re_irc> <@firefrommoonlight:matrix.org> And avoid race conditions between shared memory, periphs etc

01:24 <re_irc> <@adamgreig:matrix.org> it really might be best to generate two elfs at first

01:24 <re_irc> <@adamgreig:matrix.org> save a lot of questions around how initialisation and statics and vector tables and stuff will work

01:24 <re_irc> <@adamgreig:matrix.org> and probably not much harder to coordinate sharing access (maybe even easier...), but flashing and stuff is more annoying

01:24 <re_irc> <@adamgreig:matrix.org> check out the rp2040 hal, it has some multi-core support already

01:25 <re_irc> <@firefrommoonlight:matrix.org> Good point. I think initial moves might be #1 Setting up a clear code style (With published examples for other people who try!). #2 Set up HAL code for Semaphore periphs etc (eg HSEM on STM32)

01:25 <re_irc> <@firefrommoonlight:matrix.org> So you have a high-level API for the semaphores

01:26 <re_irc> <@adamgreig:matrix.org> adamgreig: for example: add a new flash2 memory to memory.x at 0810_0000, add a new section there for a vector table, construct that table in your code and put its link_section to that new section, and have it point to the address of a function you want the cm4 to run

01:26 <re_irc> <@adamgreig:matrix.org> in principle at that point cm4 will boot and start running that function

01:26 <re_irc> <@adamgreig:matrix.org> (and crucially won't try and do bss/static initialisation)

01:26 <re_irc> <@dirbaio:matrix.org> did anyone else's rust-analyzer just break?

01:26 <re_irc> <@dirbaio:matrix.org> today's nightly is borked

01:26 <re_irc> <@adamgreig:matrix.org> I wonder if the h7 sets the cm4 vtor to match its boot address or if you have to do that yourself

01:27 <re_irc> <@firefrommoonlight:matrix.org> LMK if there's anything I can do to test that. I'm not too good with linking/memory.x!

01:27 <re_irc> <@adamgreig:matrix.org> perhaps it just maps the boot address to the cm4's address 0, so vtor 0 still works fine

01:28 <re_irc> <@adamgreig:matrix.org> do you have the h745zi-q nucleo?

01:28 <re_irc> <@firefrommoonlight:matrix.org> Yes

01:29 <re_irc> <@firefrommoonlight:matrix.org> (Until I desolder the chip to put on a custom board lol)

01:29 <re_irc> <@adamgreig:matrix.org> lol

01:29 <re_irc> <@adamgreig:matrix.org> ugh good luck, i hate de/resoldering lqfp144

01:30 <re_irc> <@firefrommoonlight:matrix.org> I never have, and struggle enough with qfp48, but I'm out of options

01:30 <re_irc> <@adamgreig:matrix.org> i had to do a few after a bom incident at work and i didn't enjoy it

01:31 <re_irc> <@adamgreig:matrix.org> lcsc have some 743 lqfp100 in stock pretty often but no help if you want that dual core goodness

01:32 troth has joined #rust-embedded

01:32 <re_irc> <@firefrommoonlight:matrix.org> Oh! I'm very interested

01:33 <re_irc> <@firefrommoonlight:matrix.org> Showing out now, but will F5 occasionally

01:34 <re_irc> <@dirbaio:matrix.org> lol https://github.com/rust-analyzer/rust-analyzer/issues/10961

01:34 <re_irc> <@adamgreig:matrix.org> https://lcsc.com/product-detail/ST-Microelectronics_STMicroelectronics-STM32H743VIT6_C114409.html

01:35 <re_irc> <@adamgreig:matrix.org> lol, oops

01:51 <re_irc> <@adamgreig:matrix.org> ah pooo, my easy idea is somewhat ruined by how you cannot take the address of a function and put it into a static

01:51 <re_irc> <@adamgreig:matrix.org> this sucks

02:08 <re_irc> <@adamgreig:matrix.org> sweet, it works

02:08 <re_irc> <@adamgreig:matrix.org> firefrommoonlight, still around?

02:09 <re_irc> <@adamgreig:matrix.org> https://github.com/adamgreig/dual-core-demo/blob/master/src/main.rs

02:09 <re_irc> <@adamgreig:matrix.org> I'll spare you the GIF but you can imagine two LEDs blinking on the nucleo, one at half the speed of the other

02:10 rardiol has quit [Ping timeout: 240 seconds]

02:11 <re_irc> <@firefrommoonlight:matrix.org> adamgreig: WOAH

02:11 <re_irc> <@adamgreig:matrix.org> the only interesting bit is this line in memory.x that says `KEEP(*(.flash2.vector_table));` https://github.com/adamgreig/dual-core-demo/blob/master/memory.x#L73-L77

02:11 <re_irc> <@firefrommoonlight:matrix.org> Also nice re that single program!

02:12 <re_irc> <@adamgreig:matrix.org> (plus note that FLASH2 is defined in my memory.x MEMORY section, and I've cheekily hardcoded the top of SRAM2 for the second core's initial stack pointer, but you could easily get that from the linker)

02:14 <re_irc> <@firefrommoonlight:matrix.org> I'm getting `error: could not execute process `target\thumbv7em-none-eabihf\release\h7dc` (never executed)` when clone + run release

02:14 <re_irc> <@dirbaio:matrix.org> missing runner in .cargo/config

02:14 <re_irc> <@adamgreig:matrix.org> "cargo embed --release"

02:14 <re_irc> <@adamgreig:matrix.org> I haven't started using cargo run really

02:15 <re_irc> <@adamgreig:matrix.org> just pushed an update that simplifies the vector table stuff a lot actually

02:15 <re_irc> <@adamgreig:matrix.org> don't really need to be fancy and make a table

02:15 <re_irc> <@adamgreig:matrix.org> just stick stack start and reset vector into the memory.x, much cleaner

02:16 <re_irc> <@adamgreig:matrix.org> very clean https://github.com/adamgreig/dual-core-demo/blob/master/src/main.rs

02:17 <re_irc> <@firefrommoonlight:matrix.org> https://psion.agg.io/_matrix/media/r0/download/matrix.org/zUsMIpHjKnViGWklJQhoSPoW/PXL_20211208_021659775.jpg

02:17 <re_irc> <@adamgreig:matrix.org> I've given you basically a loaded shotgun pointed directly at your feet so it's worth being a little cautious about whether this is better than two separate programs and stuff

02:18 <re_irc> <@adamgreig:matrix.org> but it does seem kinda fun

02:18 <re_irc> <@adamgreig:matrix.org> probe-rs seems to handle flashing the two disjoint parts of flash well too, nice

02:18 <re_irc> <@firefrommoonlight:matrix.org> Hah. I'm still clueless with linking and everything dual core beyond "hello world", but we'll see

02:19 <re_irc> <@adamgreig:matrix.org> the setup in memory.x is pretty reasonable

02:19 <re_irc> <@adamgreig:matrix.org> most of the time it took to get this working was remembering that you need KEEP(...) in the linker script to stop it throwing everything away

02:19 <re_irc> <@firefrommoonlight:matrix.org> Idea is M7 does program logic, UI, realtime audio processing. M4 receives a copy of the microphone data from M7, uses it to adjust the filter coeffs the M7 core uses

02:19 <re_irc> <@firefrommoonlight:matrix.org> So the realtime audio isn't interrupted by the sound-analysis and filter adjustment

02:19 <re_irc> <@firefrommoonlight:matrix.org> So, what needs to pass is mic data, filter coefficients, and some signals like "Filter updated"

02:20 <re_irc> <@adamgreig:matrix.org> seems like you should be able to cook something up

02:20 <re_irc> <@firefrommoonlight:matrix.org> Probably a few ways to do it

02:25 <re_irc> <@adamgreig:matrix.org> hmm, you can really tell the cm7 is dual-issue

02:26 <re_irc> <@adamgreig:matrix.org> the cm4 and cm7 are both callying delay(16_000_000) and in theory the cm7 runs at twice the clock of the cm4

02:26 <re_irc> <@adamgreig:matrix.org> but it's definitely blinking more than twice as fast

02:27 <re_irc> <@adamgreig:matrix.org> HSI is 64MHz, gosh

02:31 <re_irc> <@firefrommoonlight:matrix.org> Same

02:31 <re_irc> <@firefrommoonlight:matrix.org> So, I made a wrong assumptin. It's only twice the speed if hclk scaler is 2

02:32 <re_irc> <@firefrommoonlight:matrix.org> I'm not sure what the default is; might be 8

02:32 <re_irc> <@firefrommoonlight:matrix.org> It was twice on my board only because I had it configured to 2

02:32 <re_irc> <@firefrommoonlight:matrix.org> *my firmware

02:32 <re_irc> <@firefrommoonlight:matrix.org> Yours is going at 8x

02:33 <re_irc> <@firefrommoonlight:matrix.org> *actually... I tried once with default of HSI and it was a slow 2x....

02:33 <re_irc> <@firefrommoonlight:matrix.org> On my firmware so, not sure actually why the diff

02:34 <re_irc> <@adamgreig:matrix.org> cpu1 is sys_ck/d1cpre givingi sys_d1cpre_ck=rcc_c1_ck, cpu2 is sys_d1cpre_ck/hpre=rcc_hclk2=rcc_c2_ck

02:34 <re_irc> <@adamgreig:matrix.org> so the difference is indeed just the hpre prescaler

02:34 <re_irc> <@adamgreig:matrix.org> (check figure 55)

02:34 <re_irc> <@adamgreig:matrix.org> and actually hpre is /1 by default

02:35 <re_irc> <@adamgreig:matrix.org> and d1cpre=/1 too, so by default both cores run at 64MHz

02:35 <re_irc> <@adamgreig:matrix.org> the reason they blink at different speeds is that the cortex-m7 core can retire more instructions per cycle than the cortex-m4 core

02:36 <re_irc> <@firefrommoonlight:matrix.org> Oh! So it's a quirk of asm::delay?

02:36 <re_irc> <@adamgreig:matrix.org> but I'm a bit surprised it turns out to only be due to that and by default they're the same clock speed, very fun

02:36 <re_irc> <@adamgreig:matrix.org> well, sort of

02:36 <re_irc> <@adamgreig:matrix.org> asm::delay promises to delay for "at least this many clock cycles"

02:36 <re_irc> <@adamgreig:matrix.org> but it might be more, and indeed on cortex-m0/1/3/4 it will be like twice as much

02:37 <re_irc> <@adamgreig:matrix.org> https://github.com/rust-embedded/cortex-m/blob/master/asm/inline.rs#L54-L67

02:37 <re_irc> <@firefrommoonlight:matrix.org> So, I tested with A: PLL with SYSTICK1 = 400Mhz and SYSTICK2 = 200Mhz. I configured teh `cortex-m` crate delay, telling it I'm using 400Mhz systick. Got 1Hz flash on the M7 LED, and 1/2Hz flash on M4 LED as expected with those settings

02:37 <re_irc> <@firefrommoonlight:matrix.org> I also tested with default config, (Presumably HSI with no PLL, both at 64MHz as you said), and didn't time it, but got much slower flashing

02:38 <re_irc> <@adamgreig:matrix.org> systick delay does not depend on core type, different to cortex_m::asm::delay

02:38 <re_irc> <@firefrommoonlight:matrix.org> That explains it entirely then

02:38 <re_irc> <@adamgreig:matrix.org> the systick delay counts systicks, which are either the cpu clock or cpu clock /8, but is accurate

02:38 <re_irc> <@adamgreig:matrix.org> cortex_m::asm::delay just counts from 0 to the number of cycles/2, which takes about 3-4 clock cycles on a cortex-m4 but only 2 on a cortex-m7

02:39 <re_irc> <@adamgreig:matrix.org> but it means on that sample code I posted, the difference in flash rates between the two cores is entirely down to the m7 doing more counting per clock cycle, which is pretty wild

02:41 <re_irc> <@firefrommoonlight:matrix.org> Yea! I didn't know it did that!

02:52 emerent has quit [Ping timeout: 265 seconds]

02:54 emerent has joined #rust-embedded

02:58 starblue has quit [Ping timeout: 252 seconds]

03:00 starblue has joined #rust-embedded

03:32 procton_ has quit [Remote host closed the connection]

03:32 procton_ has joined #rust-embedded

03:35 troth has quit [Ping timeout: 252 seconds]

03:49 troth has joined #rust-embedded

04:09 <re_irc> <@xnorman:matrix.org> so i've figure out where i'm overflowing my stack.. haven't tested yet if i hit a hardfault outside of the debugger.. and i'm trying to figure out, what is it that is getting pushed to the stack that takes up so much space

04:09 <re_irc> <@xnorman:matrix.org> my stack is at 0x20000000 and the method just before the segfault i'm getting has sp = 0x02001f9d0..

04:09 <re_irc> <@xnorman:matrix.org> so it seems like i should have a lot of space. any advice on how to figure out what is taking up all that space?

04:34 PyroPeter has quit [Ping timeout: 240 seconds]

04:36 PyroPeter has joined #rust-embedded

05:03 <re_irc> <@jamesmunns:beeper.com> You could use the bt command to get a backtrace, and figure out where all your stack frames start

05:03 <re_irc> <@jamesmunns:beeper.com> One or more of them are probably going to be chonky bois

05:06 <re_irc> <@jamesmunns:beeper.com> One blind guess is you're probably moving/returning some large buffer by value, and the compiler didn't apply an RVO to it, meaning you end up with more instances live at one time than you expect.

05:10 <re_irc> <@xnorman:matrix.org> yeah, it happens a couple of times, one out of this function which calls postcard::from_bytes which also returns a value on the stack.. and, my struct is big.. like 13000 bytes, but that just about 10x smaller than 0x1f9d0

05:12 <re_irc> <@jamesmunns:beeper.com> Yeah, with how serde works, you can sometimes end up with multiple instances (like 2-4x) when you return structs by value

05:14 <re_irc> <@jamesmunns:beeper.com> There might be a way to extend postcard to fill into a maybeuninit object, but I'm not sure off the top of my head how to do that

05:15 <re_irc> <@xnorman:matrix.org> and.. with maybeuninit, would you use &mut MaybeUninit<T> ?

05:15 <re_irc> <@xnorman:matrix.org> so you wouldn't have to pass down all the values?

05:19 <re_irc> <@xnorman:matrix.org> ahh. there is a PR for it: https://github.com/jamesmunns/postcard/pull/14/files

05:21 <re_irc> <@jamesmunns:beeper.com> Oh! I totally forgot about that!

05:22 tokomak has joined #rust-embedded

05:25 <re_irc> <@xnorman:matrix.org> i wonder, is generic-array needed now that there are const generics?

05:35 <re_irc> <@xnorman:matrix.org> ahh but that is all serialization i think..

08:17 troth has quit [Ping timeout: 252 seconds]

08:31 troth has joined #rust-embedded

09:13 dcz_ has joined #rust-embedded

09:36 <re_irc> <@luojia65:matrix.org> https://psion.agg.io/_matrix/media/r0/download/matrix.org/DuIeoMHwxgpCqPksnKiNRABg/IMG_20211208_172859.jpg

09:36 <re_irc> <@luojia65:matrix.org> https://psion.agg.io/_matrix/media/r0/download/matrix.org/uopBTuqVkESEXqkaEVDXFpWP/IMG_20211208_172929.jpg

09:37 <re_irc> <@luojia65:matrix.org> Just finished is_riscv_feature_detected on linux :)

09:53 tokomak has quit [Ping timeout: 252 seconds]

10:54 rardiol has joined #rust-embedded

11:25 <re_irc> <@texitoi:matrix.org> > unsupported feature m

11:25 <re_irc> <@texitoi:matrix.org> Really?

11:26 <re_irc> <@xiretza:xiretza.xyz> heh, rv64iafdc is an interesting combination, yes

11:31 <re_irc> <@9names:matrix.org> Yeah, that is weird! Is it actually missing M?

13:14 crabbedhaloablut has quit [Ping timeout: 276 seconds]

13:14 loki_val has joined #rust-embedded

13:40 <re_irc> <@thejpster:matrix.org> I thought Linux required G. Isn't G just IMAC?

13:46 <re_irc> <@thejpster:matrix.org> G is IMAFDZicsrZifencei

14:08 <re_irc> <@9names:matrix.org> GC is considered baseline for distro support, the kernel will work with a lot less.

15:05 <re_irc> <@xnorman:matrix.org> James Munns: re our convo last night, the stack size and returning values.. found your post 😀 https://github.com/rust-lang/rust/issues/32966#issuecomment-481659802

15:07 <re_irc> <@jamesmunns:beeper.com> Heh, guaranteed NRVO/RVO has been a pretty big axe of mine to grind

15:09 <re_irc> <@jamesmunns:beeper.com> c.f. https://twitter.com/bitshiftmask/status/1418554845397667840, https://twitter.com/bitshiftmask/status/1159196582140612608

15:10 <re_irc> <@jamesmunns:beeper.com> But yeah, I definitely became aware of it through use of postcard on embedded systems, where I also blew my stack trying to create fairly huge structs from some data source (packets on the wire? data in flash? can't remember)

15:16 <re_irc> <@xnorman:matrix.org> any suggested workarounds? I'll definitely need to grow the size of this data that I'm deserding ..

15:16 <re_irc> <@xnorman:matrix.org> i guess.. would passing down `&mut Option<T>` at least avoid the RVO issue until I get to postcard?

15:19 <re_irc> <@xnorman:matrix.org> i also wonder if i could just move my stack to this big sdram i have.. but I figure I'd have to go with a bootloader approach for that because the stack would need to be setup (there is some memory protection etc initialization for the sdram) before my program starts??

15:21 <re_irc> <@jamesmunns:beeper.com> I think that korken89's MU PR is probably the right approach, though it has gotten stale (my fault: When he first opened it I was way less comfortable/familiar with MU, so I let it bit rot).

15:21 <re_irc> <@jamesmunns:beeper.com> If you're interested in updating it or working on it, I'd be happy to mentor, especially since you have an immediate test case to verify against :D

15:22 <re_irc> <@jamesmunns:beeper.com> Re: your earlier question, I don't think the GenericArray stuff is needed anymore, since Postcard switched to using const generics instead

15:23 <re_irc> <@adamgreig:matrix.org> in theory you can still put your stack on the sdram and move your setup to the pre_init method that runs before c-m-rt initialises bss

15:23 <re_irc> <@xnorman:matrix.org> yeah, i can definitely look into that.. I was thinking that that work was only serde and not deserde but I can give deserde a go if so

15:25 <re_irc> <@xnorman:matrix.org> adamgreig: so pre_init wouldn't need the stack?

15:25 <re_irc> <@jamesmunns:beeper.com> Ah, I'm not sure what it covers anymore. I actually should double check to make sure deser is possible with serde and maybeuninit

15:27 <re_irc> <@adamgreig:matrix.org> you'd have to write a pre_init that didn't use the stack

15:27 <re_irc> <@adamgreig:matrix.org> potentially you could leave the internal sram as the vector table SP, and then in pre_init configure the SDRAM and then update the MSP register

15:27 <re_irc> <@adamgreig:matrix.org> https://docs.rs/cortex-m/latest/cortex_m/register/msp/fn.write.html

15:27 <re_irc> <@adamgreig:matrix.org> wonder if that should really be deprecated

15:27 <re_irc> <@adamgreig:matrix.org> I guess you probably can't call it from rust really

15:28 <re_irc> <@xnorman:matrix.org> adamgreig: I guess it would have to be assembly then?

15:28 <re_irc> <@jamesmunns:beeper.com> Hmm, actually thinking about it, I don't think you can deserialize directly into a MU, since Serde returns everything by value...

15:28 <re_irc> <@adamgreig:matrix.org> in an ideal world your pre_init would be a naked fn with assembly but that requires nightly rust atm

15:28 <re_irc> <@adamgreig:matrix.org> it's a bit cursed for sure

15:28 <re_irc> <@jamesmunns:beeper.com> Alex Norman what does your data actually look like? Do you have some huge array of bytes (like sample data)? Or is it actually a ton of nested structs?

15:29 <re_irc> <@adamgreig:matrix.org> hmm, perhaps you could set up the PSP in SDRAM and swap to using PSP in main for postcard, while using MSP for interrupts

15:29 <re_irc> <@xnorman:matrix.org> jamesmunns:beeper.com: nested structs, some of those are arrays

15:30 <re_irc> <@jamesmunns:beeper.com> Would you be able to link it for me? I might be able to suggest some tweaks to your data model to sidestep the problem

15:31 <re_irc> <@xnorman:matrix.org> adamgreig: I'm not sure what this means.. i'll have to look up psp

15:31 <re_irc> <@adamgreig:matrix.org> cortex-m can have two stacks, MSP and PSP

15:31 <re_irc> <@adamgreig:matrix.org> interrupts always use MSP but your main thread can use either

15:32 <re_irc> <@adamgreig:matrix.org> https://interrupt.memfault.com/blog/cortex-m-rtos-context-switching#stack-pointers-and-usage has some details

15:32 <re_irc> <@adamgreig:matrix.org> but it's not hugely elegant, heh, I guess ideally you'd find a way to not require like 100kB of stack

15:33 <re_irc> <@xnorman:matrix.org> jamesmunns:beeper.com: http://pastie.org/p/1ZKQbW6N0ZbvoxL3mJqztT .. the SchedData is the thing i'm serde/deserd

15:35 <re_irc> <@jamesmunns:beeper.com> hmmm

15:36 <re_irc> <@jamesmunns:beeper.com> I can suggest some hacks to avoid doing it all at once, basically deserializing a chunk at a time, and manually pushing each chunk into the final collection

15:36 <re_irc> <@jamesmunns:beeper.com> it's not super elegant though

15:37 <re_irc> <@xnorman:matrix.org> yeah.. that is one thing i was thinking i could do.. i don't love it, but might-could work

15:38 <re_irc> <@xnorman:matrix.org> I'll edit that, i don't like it .. but would do it if i need to

15:38 <re_irc> <@jamesmunns:beeper.com> Yeah, let me think...

15:39 <re_irc> <@jamesmunns:beeper.com> `16^3` is... whew :D

15:39 rardiol has quit [Ping timeout: 265 seconds]

15:41 <re_irc> <@jamesmunns:beeper.com> It might also be interesting taking something non-serde for a spin, like `zerocopy`: https://docs.rs/zerocopy

15:43 <re_irc> <@xnorman:matrix.org> jamesmunns:beeper.com: cool...i'll check that out. i'm at my day job now but have this all noted for later

15:43 <re_irc> <@xnorman:matrix.org> thanks for your suggestions James Munns and adamgreig !

15:49 <re_irc> <@jamesmunns:beeper.com> I'll keep thinking, but I think without RVO, changes to Serde, or a little bit of hacks (e.g. walking through the bytes, deserializing one larger item at a time), I'm not sure I'll have any magic for you.

15:49 <re_irc> <@jamesmunns:beeper.com> I'm assuming you already have `lto = "full"` and `codegen-units = 1` turned on? Those will help give you the best chance for llvm to apply space-saving opts

15:52 <re_irc> <@xnorman:matrix.org> jamesmunns:beeper.com: i have `lto = true` and `codegen-units = 1`

15:53 <re_irc> <@jamesmunns:beeper.com> Darn. Perils of heuristics based optimizers.

16:29 rardiol has joined #rust-embedded

19:59 <re_irc> <@dirbaio:matrix.org> is it possible to tell Cargo to `[patch]` a path dependency to a *different* path?

20:02 <re_irc> <@dirbaio:matrix.org> this fails

20:02 <re_irc> <@dirbaio:matrix.org> [patch.'./stm32-metapac']

20:02 <re_irc> <@dirbaio:matrix.org> stm32-metapac = { path = "./stm32-metapac-gen/out" }

20:03 <re_irc> <@dirbaio:matrix.org> and this parses but does nothing:

20:03 <re_irc> <@dirbaio:matrix.org> > warning: Patch `stm32-metapac v0.1.0 (/home/dirbaio/embassy/embassy/stm32-metapac-gen/out)` was not used in the crate graph.

20:03 <re_irc> <@dirbaio:matrix.org> stm32-metapac = { path = "./stm32-metapac-gen/out" }

20:03 <re_irc> <@dirbaio:matrix.org> [patch.'file://./stm32-metapac']

20:09 <re_irc> <@adamgreig:matrix.org> Rust 2021 survey is open! https://blog.rust-lang.org/2021/12/08/survey-launch.html

20:10 rektide has joined #rust-embedded

20:14 rektide_ has quit [Ping timeout: 260 seconds]

21:58 dcz_ has quit [Ping timeout: 256 seconds]