#rust-embedded on 2023-04-05 — irc logs at libera.irclog.whitequark.org

2022-02-07 19:20 ChanServ changed the topic of #rust-embedded to: Welcome to the Rust Embedded IRC channel! Bridged to #rust-embedded:matrix.org and logged at https://libera.irclog.whitequark.org/rust-embedded, code of conduct at https://www.rust-lang.org/conduct.html

00:00 rardiol has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

00:40 lehmrob has quit [Ping timeout: 265 seconds]

00:41 lehmrob has joined #rust-embedded

00:41 rardiol has joined #rust-embedded

00:51 dc740 has quit [Remote host closed the connection]

03:15 rardiol has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

03:59 lehmrob has quit [Ping timeout: 268 seconds]

07:11 <re_irc> <@eldruin:matrix.org> "embedded-hal" "1.0.0-alpha.10" (https://github.com/rust-embedded/embedded-hal/releases/tag/v1.0.0-alpha.10) along with new releases of "embedded-hal-async", "embedded-hal-bus" and "embedded-hal-nb" have just been published! Thanks goes to as well as everyone else involved in the discussions 🎉

08:48 lehmrob has joined #rust-embedded

08:52 <re_irc> <@ryan-summers:matrix.org> Does probe-run support a TOML configuration file for specifying specific probes (like Embed.local.toml)?

09:03 <re_irc> <@eldruin:matrix.org> hmm, the chat client is giving me some errors. I hope you saw that there are new "embedded-hal" (https://github.com/rust-embedded/embedded-hal/releases/tag/v1.0.0-alpha.10) and related crates alpha releases

09:04 <re_irc> <@ryan-summers:matrix.org> Very weird. I remember seeing it literally like 15 minutes ago, but now it's gone

09:06 IlPalazzo-ojiisa has joined #rust-embedded

09:06 lehmrob has quit [Ping timeout: 265 seconds]

10:07 rardiol has joined #rust-embedded

10:31 <re_irc> <@maarten2000ha:matrix.org> Are there people here who are able to build the esp edf template on an m1 chip? I am having some issues with it for quite some time and I want to get into embedded programming but it’s quite difficult if I can’t even build the project.

10:31 <re_irc> ✅ rust installed, ✅ cargo-generate, ldproxy, espup, espflash, cargo-espflash installed, ✅ nightly toolchain installed and set as default, ✅espup installation and export, ✅ clean project, ❌ able to build project. gist link of the latest error https://gist.github.com/maarten2000ha/4a0beadb43e204cb98ca9f6b96e38dd8

10:44 <re_irc> <@maarten2000ha:matrix.org> Are there people here who are able to build the esp edf template on an m1 chip? I am having some issues with it for quite some time and I want to get into embedded programming but it’s quite difficult if I can’t even build the project. ✅ rust installed, ✅ cargo-generate, ldproxy, espup, espflash, cargo-espflash installed, ✅ nightly toolchain installed and set as default, ✅espup installation and export, ✅...

10:44 <re_irc> ... clean project, ❌ able to build project. gist link of the latest error https://gist.github.com/maarten2000ha/4a0beadb43e204cb98ca9f6b96e38dd8

10:46 rardiol has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

14:25 tafa has quit [Quit: ZNC - https://znc.in]

14:26 tafa has joined #rust-embedded

14:40 emerent has quit [Ping timeout: 260 seconds]

14:40 emerent has joined #rust-embedded

15:37 <re_irc> <@thejpster:matrix.org> You could try the esp-rs room?

16:04 rardiol has joined #rust-embedded

16:08 cr1901 has quit [Read error: Connection reset by peer]

16:09 cr1901 has joined #rust-embedded

16:37 <re_irc> <@dngrs:matrix.org> : from your log it looks like you're building for xtensa. As far as I know xtensa support is not yet merged in "normal" nightly, and you need espressif's fork to build. Might be worth double checking the instructions (https://esp-rs.github.io/book/installation/installation.html)

16:38 <re_irc> <@dngrs:matrix.org> (skip the RISC-V section and go straight to Xtensa)

17:53 IlPalazzo-ojiisa has quit [Quit: Leaving.]

17:53 IlPalazzo-ojiisa has joined #rust-embedded

18:57 <re_irc> <@dngrs:matrix.org> heads up for anyone experiencing defmt issues, seems to me it's currently best to pin it to "defmt="=0.3.2"" (otherwise you get wire format v4, and "probe-run" hasn't yet been updated to support that)

19:12 fooker has quit [Quit: WeeChat 3.7.1]

19:14 <re_irc> <@heniluci:matrix.org> Hey, hope you are all doing well!

19:14 <re_irc> I've been working on my own embedded rust project (a "no_std" KYBER implementation), and have been trying to run time consistency tests on my micro-controller. As such, I'm currently trying to validate that the number of clock cycles are constant for each of my critical functions, and I can measure the clock cycle time directly through cortex_m::Peripherals

19:14 <re_irc> However, I can't find any bench-marking framework which supports measuring this, or even any projects that take measurements (be they clock or real-world time) of a micro-controller's runtime.

19:14 <re_irc> As such, are there any good libraries/examples on profiling execution time on a micro-controller, and if not, are there any examples on how to write your own bench-marking framework for an embedded system?

19:17 fooker has joined #rust-embedded

19:31 rardiol has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

19:56 <re_irc> <@thejpster:matrix.org> I think you want CYCCNT. Which chip are you using?

20:16 markov_twain has joined #rust-embedded

21:18 <re_irc> <@heniluci:matrix.org> : I'm using an M4 chip; I've been measuring the time

21:18 <re_irc> let mut peripherals = Peripherals::take().unwrap();

21:18 <re_irc> peripherals.DWT.enable_cycle_counter();

21:18 <re_irc> peripherals.DWT.set_cycle_count(0);

21:18 <re_irc> accum += 1;

21:18 <re_irc> time = peripherals.DWT.cyccnt.read();

21:18 <re_irc> This does get me an individual measurement, however, I'm consistently running into problems where it gets optimized away 😢. Moreover, I've been trying to write a proper function testing framework, but that's been more difficult than expected

21:19 <re_irc> <@heniluci:matrix.org> * moreover, I believe I'm using that counter via the "cortex_m" crate

21:20 <re_irc> <@heniluci:matrix.org> This has been my shot at writing a proper benchmarking function; however, it's a bit dirty, and I've been finding it consistently optimizes away my benchmarks, so it's not working rn 😢

21:20 <re_irc> use core::hint::black_box;

21:20 <re_irc> use cortex_m::Peripherals;

21:20 <re_irc> #[inline(never)]

21:20 <re_irc> #[no_mangle]

21:20 <re_irc> fn benchmark<const TRIALS: usize, const ARG_LIST_SIZE: usize, F, T, U>(

21:20 <re_irc> function: F,

21:20 <re_irc> args_list: &[T; ARG_LIST_SIZE],

21:20 <re_irc> ) -> [[u32; TRIALS]; ARG_LIST_SIZE]

21:20 <re_irc> where

21:20 <re_irc> F: Fn(&T) -> U,

21:20 <re_irc> U: Copy,

21:20 <re_irc> {

21:20 <re_irc> let mut peripherals = Peripherals::take().unwrap();

21:20 <re_irc> let mut results = [[0u32; TRIALS]; ARG_LIST_SIZE];

21:20 <re_irc> let mut elapsed_cycles;

21:20 <re_irc> let mut result;

21:20 <re_irc> for trial_index in 0..TRIALS {

21:20 <re_irc> for (arg_index, args) in args_list.iter().enumerate() {

21:20 <re_irc> peripherals.DWT.enable_cycle_counter();

21:20 <re_irc> // Run the function and measure the cycles

21:20 <re_irc> peripherals.DWT.set_cycle_count(0);

21:20 <re_irc> result = function(args);

21:20 <re_irc> elapsed_cycles = peripherals.DWT.cyccnt.read();

21:20 <re_irc> // Prevent the compiler from optimizing away the benchmark code

21:20 <re_irc> black_box(elapsed_cycles);

21:20 <re_irc> black_box(result);

21:20 <re_irc> // hprintln!("Total elapsed clock cycles: {:}", elapsed_cycles);

21:20 <re_irc> results[arg_index][trial_index] = elapsed_cycles;

21:20 <re_irc> }

21:21 <re_irc> }

21:21 <re_irc> results

21:21 <re_irc> }

21:21 <re_irc> <@jamesmunns:beeper.com> Which specific M4 you are using is useful, some vendors (like ST I think?) sometimes need a specific sequence to use DWT if the debugger is not attached

21:21 <re_irc> <@heniluci:matrix.org> : I'm using an XMC4500 microcontroller

21:21 <re_irc> <@heniluci:matrix.org> XMC4500 Relax Lite Kit

21:24 <re_irc> <@heniluci:matrix.org> oh, and as a disclaimer, I can get this working on individual functions

21:25 <re_irc> a) Is there an out-of-the-box solution that already solves this?

21:25 <re_irc> <@heniluci:matrix.org> my big issue rn is:

21:25 <re_irc> b) If not, then how can I stop my own benchmark function from optimizing itself to death?

21:26 <re_irc> <@jamesmunns:beeper.com> > I can get this working on individual functions

21:26 <re_irc> As in you see non-zero DWT counts?

21:29 <re_irc> <@heniluci:matrix.org> https://psion.agg.io/_matrix/media/r0/download/matrix.org/EAWtEBYlFQOSzHSzmeaclzZa/image.png

21:29 <re_irc> <@heniluci:matrix.org> yep; see

21:29 <re_irc> #![cfg_attr(not(test), no_std)]

21:29 <re_irc> // pick a panicking behaviour

21:29 <re_irc> #[cfg(not(test))]

21:29 <re_irc> #![cfg_attr(not(test), no_main)]

21:29 <re_irc> #![cfg_attr(test, allow(unused_imports))]

21:29 <re_irc> #[cfg(debug_assertions)]

21:29 <re_irc> use panic_semihosting as _;

21:29 <re_irc> // release profile: minimize the binary size of the application

21:29 <re_irc> #[cfg(not(test))]

21:29 <re_irc> #[cfg(not(debug_assertions))]

21:29 <re_irc> use panic_abort as _; // requires nightly

21:29 <re_irc> use cortex_m::Peripherals;

21:29 <re_irc> // #[cfg(not(test))]

21:29 <re_irc> #[entry]

21:29 <re_irc> fn main() -> ! {

21:29 <re_irc> // https://community.infineon.com/t5/XMC/XMC4500-measure-elapsed-execution-time/td-p/312597

21:29 <re_irc> let mut peripherals = Peripherals::take().unwrap();

21:29 <re_irc> peripherals.DWT.enable_cycle_counter();

21:29 <re_irc> let mut accum = 0;

21:29 <re_irc> let mut time;

21:29 <re_irc> loop {

21:29 <re_irc> // NOTE: inspecting the assembly of this shows that rust *does* shuffle the ordering of instructions for optimal code execution

21:29 <re_irc> peripherals.DWT.set_cycle_count(0);

21:29 <re_irc> accum += 1;

21:29 <re_irc> time = peripherals.DWT.cyccnt.read();

21:30 <re_irc> hprintln!(

21:30 <re_irc> "Clock count {}: {} -> {}. Time taken is {}",

21:30 <re_irc> accum,

21:30 <re_irc> 0,

21:30 <re_irc> time,

21:30 <re_irc> time

21:30 <re_irc> );

21:30 <re_irc> // Sometimes, the first instruction will take a few more/less cycles

21:30 <re_irc> // if (time_2).abs_diff(time_1 + 6) > 1 {

21:30 <re_irc> // panic!("Warning! Time was not 6!")

21:30 <re_irc> // }

21:30 <re_irc> // panic!("Done!")

21:30 <re_irc> }

21:30 <re_irc> This yields

21:30 <re_irc> <@heniluci:matrix.org> (p.s. sorry, got my messages the wrong way round 😅)

21:32 <re_irc> <@jamesmunns:beeper.com> I dunno what you return, but you could either return "[[(u32, T); TRIALS]; ARG_LIST_SIZE]" and print it/check they are all equal to make sure it is used, or pass in a single "&mut T" or "&mut MaybeUninit<T>" and do a "write_volatile()" to it

21:33 <re_irc> <@jamesmunns:beeper.com> but I don't know anything out of the box for this, sadly

21:39 <re_irc> <@heniluci:matrix.org> : why would the "write_volatile()" solve it in this situation?

21:39 <re_irc> <@heniluci:matrix.org> (sorry if this ends up being a bit of a noob question 😅)

21:39 <re_irc> <@jamesmunns:beeper.com> volatile writes can't be elided by the compiler

21:40 <re_irc> <@jamesmunns:beeper.com> I guess black_box should have the same effect, but adding volatile writes is usually the other thing I do to make sure the compiler doesn't elide something

21:41 <re_irc> <@grantm11235:matrix.org> https://doc.rust-lang.org/core/hint/fn.black_box.html

21:41 <re_irc> <@grantm11235:matrix.org> black_box should work, just be sure to use it for your inputs and your outputs

21:42 <re_irc> <@jamesmunns:beeper.com> Yes, they are already using it on the output of their function, and I'm aware of black box.

21:42 <re_irc> <@jamesmunns:beeper.com> (I didn't know it was stable though now!)

21:45 <re_irc> <@grantm11235:matrix.org> : oops, I guess I should have read more carefully

21:45 <re_irc> <@jamesmunns:beeper.com> You are right though! They are not using it on the inputs, which the docs suggest

21:46 <re_irc> <@jamesmunns:beeper.com> heniluci you could try "result = function(black_box(args));" too :p

21:46 <re_irc> <@heniluci:matrix.org> : ooo, good call

21:47 <re_irc> <@heniluci:matrix.org> I also tried out something else in the meantime

21:47 <re_irc> <@heniluci:matrix.org> and that _may_ have worked?

21:47 <re_irc> <@heniluci:matrix.org> #[inline(never)]

21:47 <re_irc> where

21:47 <re_irc> F: Fn(&T) -> U,

21:47 <re_irc> fn benchmark_single<F, T, U>(function: F, args: &T, peripheral: &mut Peripherals) -> u32

21:47 <re_irc> {

21:47 <re_irc> peripheral.DWT.enable_cycle_counter();

21:47 <re_irc> // Run the function and measure the cycles

21:47 <re_irc> peripheral.DWT.set_cycle_count(0);

21:47 <re_irc> black_box(function(args));

21:47 <re_irc> peripheral.DWT.cyccnt.read()

21:47 <re_irc> }

21:47 <re_irc> // #[no_mangle]

21:47 <re_irc> #[inline(never)]

21:47 <re_irc> fn benchmark<const TRIALS: usize, const ARG_LIST_SIZE: usize, F, T, U>(

21:47 <re_irc> function: F,

21:47 <re_irc> args_list: &[T; ARG_LIST_SIZE],

21:47 <re_irc> ) -> [[u32; TRIALS]; ARG_LIST_SIZE]

21:47 <re_irc> where

21:47 <re_irc> F: Fn(&T) -> U,

21:47 <re_irc> U: Copy,

21:47 <re_irc> {

21:47 <re_irc> let mut peripherals = Peripherals::take().unwrap();

21:47 <re_irc> let mut results = [[0u32; TRIALS]; ARG_LIST_SIZE];

21:47 <re_irc> let mut elapsed_cycles: u32;

21:47 <re_irc> for trial_index in 0..TRIALS {

21:47 <re_irc> for (arg_index, args) in args_list.iter().enumerate() {

21:47 <re_irc> elapsed_cycles = benchmark_single(&function, args, &mut peripherals);

21:47 <re_irc> // Prevent the compiler from optimizing away the benchmark code

21:47 <re_irc> black_box(elapsed_cycles);

21:47 <re_irc> // hprintln!("Total elapsed clock cycles: {:}", elapsed_cycles);

21:47 <re_irc> results[arg_index][trial_index] = elapsed_cycles;

21:47 <re_irc> }

21:47 <re_irc> results

21:48 <re_irc> }

21:48 <re_irc> <@heniluci:matrix.org> tried splitting it into 2 functions, and used "#[inline(never)]" to avoid any mixing

21:48 <re_irc> <@heniluci:matrix.org> but have only tested it against a _very_ basic function

21:51 <re_irc> <@heniluci:matrix.org> : can confirm it _does_ also work with this, but it also increases overhead?

21:51 <re_irc> trial code is:

21:51 <re_irc> fn my_function_with_arg(x: &u32) -> u32 {

21:51 <re_irc> // hprintln!("Run function!");

21:51 <re_irc> // An example function to be benchmarked

21:51 <re_irc> }

21:51 <re_irc> x + 2

21:51 <re_irc> // Run the benchmark

21:51 <re_irc> #[entry]

21:51 <re_irc> fn main() -> ! {

21:51 <re_irc> hprintln!("Starting main!");

21:51 <re_irc> let results_1 = benchmark::<3, 4, _, _, _>(my_function_with_arg, &[1, 2, 3, 4]);

21:51 <re_irc> // let results_2 = benchmark::<10, 3, _, _, _>(my_function_with_args, &[(1, 2), (2, 3), (3, 4)]);

21:51 <re_irc> // ::<10, 3, , (u32, u32), u32>

21:51 <re_irc> hprintln!("Ending benchmark! results were {:?}", results_1);

21:51 <re_irc> loop {

21:51 <re_irc> panic!("Done!");

21:51 <re_irc> }

21:52 <re_irc> <@heniluci:matrix.org> : with the change it measures from 4 clock cycles to 6 clock cycles

21:53 <re_irc> <@heniluci:matrix.org> > [[4, 4, 4], [4, 4, 4], [4, 4, 4], [4, 4, 4]]

21:53 <re_irc> > But all measurements are consistent!

21:53 <re_irc> <@jamesmunns:beeper.com> hard to tell if that's a good thing (more accurate) or a bad thing (more overhead) without looking at the asm

21:53 <re_irc> <@heniluci:matrix.org> true

21:53 <re_irc> <@heniluci:matrix.org> seems like I need to try a few crazier functions to see if I can push it to the limit

21:53 <re_irc> <@heniluci:matrix.org> also

21:53 <re_irc> <@heniluci:matrix.org> I don't know if it works with multiple arguments

21:53 <re_irc> <@heniluci:matrix.org> do you know if there's a way to unpack tuples into function arguments?

21:54 <re_irc> <@heniluci:matrix.org> or something equivilent to that

21:54 <re_irc> <@jamesmunns:beeper.com> nah, rust doesn't have splatting

21:54 <re_irc> <@jamesmunns:beeper.com> you take a fn arg tho, so you could do it with a closure

21:55 <re_irc> <@jamesmunns:beeper.com> like

21:55 <re_irc> let results_1 = benchmark::<3, 4, _, _, _>(|(a, b)| { my_function_with_arg(a, b) }, &[(1, 1), (2, 2), (3, 3), (4, 4)]);

21:56 <re_irc> <@heniluci:matrix.org> : nice, will give that a try

21:57 <re_irc> <@jamesmunns:beeper.com> (specifically, you are taking Fn not fn, the former is a trait, which closures and functions both implement, the latter of which is specifically a function pointer)

21:58 <re_irc> <@jamesmunns:beeper.com> no idea how that'll affect your benchmarking, but syntax wise you can :D

22:01 <re_irc> <@heniluci:matrix.org> hmmm, this does seem to panic on my microcontroller

22:02 <re_irc> <@heniluci:matrix.org> not sure why

22:02 <re_irc> <@jamesmunns:beeper.com> That.... is not expected :p

22:02 <re_irc> <@jamesmunns:beeper.com> nothing in a closure should be a panic by itself

22:03 <re_irc> #![cfg_attr(test, allow(unused_imports))]

22:03 <re_irc> <@heniluci:matrix.org> #![feature(test)]

22:03 <re_irc> #![cfg_attr(not(test), no_main)]

22:03 <re_irc> // pick a panicking behaviour

22:03 <re_irc> #![cfg_attr(not(test), no_std)]

22:03 <re_irc> #[cfg(not(test))]

22:03 <re_irc> #[cfg(debug_assertions)]

22:03 <re_irc> use panic_semihosting as _;

22:03 <re_irc> // release profile: minimize the binary size of the application

22:03 <re_irc> #[cfg(not(test))]

22:03 <re_irc> #[cfg(not(debug_assertions))]

22:03 <re_irc> use panic_abort as _; // requires nightly

22:03 <re_irc> use cortex_m_rt::entry;

22:03 <re_irc> use cortex_m_semihosting::hprintln;

22:04 <re_irc> use core::hint::black_box;

22:04 <re_irc> use cortex_m::Peripherals;

22:04 <re_irc> #[inline(never)]

22:04 <re_irc> fn benchmark_single<F, T, U>(function: F, args: &T, peripheral: &mut Peripherals) -> u32

22:04 <re_irc> where

22:04 <re_irc> F: Fn(&T) -> U,

22:04 <re_irc> {

22:04 <re_irc> peripheral.DWT.enable_cycle_counter();

22:04 <re_irc> // Run the function and measure the cycles

22:04 <re_irc> peripheral.DWT.set_cycle_count(0);

22:04 <re_irc> black_box(function(args));

22:04 <re_irc> peripheral.DWT.cyccnt.read()

22:04 <re_irc> }

22:04 <re_irc> // #[no_mangle]

22:04 <re_irc> #[inline(never)]

22:04 <re_irc> fn benchmark<const TRIALS: usize, const ARG_LIST_SIZE: usize, F, T, U>(

22:04 <re_irc> function: F,

22:04 <re_irc> args_list: &[T; ARG_LIST_SIZE],

22:04 <re_irc> ) -> [[u32; TRIALS]; ARG_LIST_SIZE]

22:04 <re_irc> where

22:04 <re_irc> F: Fn(&T) -> U,

22:04 <re_irc> U: Copy,

22:04 <re_irc> {

22:04 <re_irc> let mut peripherals = Peripherals::take().unwrap();

22:04 <re_irc> let mut results = [[0u32; TRIALS]; ARG_LIST_SIZE];

22:04 <re_irc> let mut elapsed_cycles: u32;

22:04 <re_irc> for trial_index in 0..TRIALS {

22:04 <re_irc> for (arg_index, args) in args_list.iter().enumerate() {

22:04 <re_irc> elapsed_cycles = benchmark_single(&function, args, &mut peripherals);

22:04 <re_irc> // Prevent the compiler from optimizing away the benchmark code

22:04 <re_irc> black_box(elapsed_cycles);

22:04 <re_irc> // hprintln!("Total elapsed clock cycles: {:}", elapsed_cycles);

22:04 <re_irc> results[arg_index][trial_index] = elapsed_cycles;

22:04 <re_irc> }

22:04 <re_irc> results

22:04 <re_irc> }

22:04 <re_irc> // An example function to be benchmarked

22:04 <re_irc> fn my_function_with_args(x: &u32, y: &u32) -> u32 {

22:05 <re_irc> x + y

22:05 <re_irc> // hprintln!("Run function!");

22:05 <re_irc> }

22:05 <re_irc> // An example function to be benchmarked

22:05 <re_irc> fn my_function_with_arg(x: &u32) -> u32 {

22:05 <re_irc> // hprintln!("Run function!");

22:05 <re_irc> x + 2

22:05 <re_irc> }

22:05 <re_irc> // Run the benchmark

22:05 <re_irc> #[entry]

22:05 <re_irc> fn main() -> ! {

22:05 <re_irc> hprintln!("Starting main!");

22:05 <re_irc> let results_1 = benchmark::<3, 4, _, _, _>(my_function_with_arg, &[1, 2, 3, 4]);

22:05 <re_irc> hprintln!("Ending benchmark! results were {:?}", results_1);

22:05 <re_irc> let results_2 = benchmark::<3, 4, _, _, _>(

22:05 <re_irc> |(a, b)| my_function_with_args(a, b),

22:05 <re_irc> &[(1, 1), (2, 2), (3, 3), (4, 4)],

22:05 <re_irc> );

22:05 <re_irc> hprintln!("Ending benchmark! results were {:?}", results_2);

22:05 <re_irc> loop {

22:05 <re_irc> hprintln!("End!");

22:05 <re_irc> panic!("Done!")

22:05 <re_irc> }

22:05 <re_irc> This is my total code rn; it's panicing when it moves into the first loop of the "result_2" iteration

22:05 <re_irc> <@jamesmunns:beeper.com> do you have some kind of array size index?

22:05 <re_irc> <@heniluci:matrix.org> and commenting out all measurement code doesn't solve it, so it's something with running this function

22:05 <re_irc> <@jamesmunns:beeper.com> you could try using iter zip instead of indexing

22:05 <re_irc> <@heniluci:matrix.org> : yh, the 4 in "benchmark::<3, 4, _, _, _>" denotes that it accepts a list of 4 arguments

22:06 <re_irc> <@heniluci:matrix.org> needed it cuz it uses an array for the results, so it needs to know the length at compile time

22:07 <re_irc> <@heniluci:matrix.org> I can make it inferred by adding "#![feature(generic_arg_infer)]", but that doesn't solve the closure problem

22:07 <re_irc> <@jamesmunns:beeper.com> Yeah, I can't really browse the inline code here very well. it might be easier to see in a gist. What panic do you get? my only guess is that you have some kind of indexing error somewhere

22:08 <re_irc> <@heniluci:matrix.org> maybe it's to do with borrowing the closure?

22:08 <re_irc> <@heniluci:matrix.org> "elapsed_cycles = benchmark_single(&function, args, &mut peripherals);" I'm borrowing the function here

22:08 <re_irc> <@jamesmunns:beeper.com> those should be compile errors not runtime errors.

22:10 <re_irc> <@heniluci:matrix.org> true

22:10 <re_irc> <@jamesmunns:beeper.com> fn benchmark<const TRIALS: usize, const ARG_LIST_SIZE: usize, F, T, U>(

22:10 <re_irc> function: F,

22:10 <re_irc> args_list: &[T; ARG_LIST_SIZE],

22:10 <re_irc> where

22:10 <re_irc> ) -> [[u32; TRIALS]; ARG_LIST_SIZE]

22:10 <re_irc> F: Fn(&T) -> U,

22:10 <re_irc> U: Copy,

22:10 <re_irc> {

22:10 <re_irc> let mut peripherals = Peripherals::take().unwrap();

22:10 <re_irc> let mut results = [[0u32; TRIALS]; ARG_LIST_SIZE];

22:10 <re_irc> let mut elapsed_cycles: u32;

22:10 <re_irc> for (arg, trials) in args_list.iter().zip(results.iter_mut()) {

22:10 <re_irc> for trial in trials.iter_mut() {

22:10 <re_irc> trial = black_box(

22:10 <re_irc> benchmark_single(&function, args, &mut peripherals)

22:10 <re_irc> );

22:10 <re_irc> }

22:10 <re_irc> results

22:10 <re_irc> }

22:10 <re_irc> <@jamesmunns:beeper.com> or something like that

22:11 <re_irc> <@heniluci:matrix.org> "panicked at 'called "Option::unwrap()"on a"None" value', examples/time_bench.rs:45:47"

22:11 <re_irc> <@heniluci:matrix.org> error is

22:11 <re_irc> <@heniluci:matrix.org> oh

22:11 <re_irc> <@heniluci:matrix.org> OH

22:11 <re_irc> <@heniluci:matrix.org> I don't think you can unwrap the peripheral twice!

22:11 <re_irc> <@jamesmunns:beeper.com> "let mut peripherals = Peripherals::take().unwrap();"

22:11 <re_irc> <@heniluci:matrix.org> that's it

22:11 <re_irc> <@jamesmunns:beeper.com> lol no

22:11 <re_irc> <@jamesmunns:beeper.com> good catch :p

22:11 <re_irc> <@jamesmunns:beeper.com> either pass it in, or use the unsafe "steal" instead

22:12 <re_irc> <@jamesmunns:beeper.com> (or just pass in the DWT as an arg to benchmark)

22:12 <re_irc> <@heniluci:matrix.org> : oooh, will try that out

22:12 <re_irc> <@heniluci:matrix.org> otherwise, could turn it into a struct which implements a few methods

22:12 <re_irc> <@heniluci:matrix.org> and make it global?

22:13 <re_irc> <@jamesmunns:beeper.com> probably don't make it global

22:14 <re_irc> <@heniluci:matrix.org> : what would stop a user from invoking and creating 2 testing frameworks then?

22:14 <re_irc> <@heniluci:matrix.org> that both try to unwrap peripherals

22:14 <re_irc> <@heniluci:matrix.org> maybe also make it unwrap?

22:17 <re_irc> <@jamesmunns:beeper.com> unwrapping peripherals is usually a "just once at top of main" thing.

22:17 <re_irc> <@jamesmunns:beeper.com> if you write a function that takes the DWT periph, pass it in, or use something like "let dwt = unsafe { &*DWT::ptr() };"

22:17 <re_irc> <@jamesmunns:beeper.com> which is like "steal()" for a single peripheral

22:18 <re_irc> <@jamesmunns:beeper.com> if you write a function that takes the DWT periph, pass it in, or use something like "let dwt = unsafe { &*DWT::PTR };"

22:27 <re_irc> <@grantm11235:matrix.org> By the way, it is possible to make a function that is generic over the number of arguments in another function, but it requires several nightly-only features https://play.rust-lang.org/?version=nightly&mode=debug&edition=2021&gist=a1113c9e34369ac10009bc208b22d64f

22:39 <re_irc> <@peter9477:matrix.org> heniluci: This may not be relevant, and I didn't read back thoroughly here, but I did notice you were not enabling trace on DCB first, and at least on the chip I'm using that appears to be required. I have this code when I start the cycle counter... note the first line and the comment.

22:39 <re_irc> cp.DCB.enable_trace(); // required before enable_cycle_counter() per docs

22:39 <re_irc> cp.DWT.enable_cycle_counter();

22:40 <re_irc> <@peter9477:matrix.org> (If it's actually working without that for you, then I guess this is not relevant for your situation.)

22:40 <re_irc> <@heniluci:matrix.org> : I'll double check in case it does help

22:41 <re_irc> <@peter9477:matrix.org> I think someone earlier mentioned something to the effect that you may need that "if not using debugging (SWD)"... this is probably why I have that, since I do this for stats like you even when not on the probe.

22:41 <re_irc> <@heniluci:matrix.org> : but I know mentioned that some vendors need a specific sequence before using DWT; perhaps it's this you're talking about?

22:41 <re_irc> <@peter9477:matrix.org> yup :)

22:41 <re_irc> <@jamesmunns:beeper.com> : Yeah, this is what I meant

22:42 <re_irc> <@peter9477:matrix.org> I'm guessing... unfortunately I didn't link to the docs that told me that.

22:42 <re_irc> <@jamesmunns:beeper.com> I think most (all?) debuggers will enable it when they attache

22:42 <re_irc> <@jamesmunns:beeper.com> * attach

22:42 <re_irc> <@peter9477:matrix.org> Would that be fairly universal Cortex-M4 behaviour, or vendor-specific?

22:42 <re_irc> <@jamesmunns:beeper.com> you run into the problem when you want to run the same code when the debugger isn't attached

22:43 <re_irc> <@jamesmunns:beeper.com> : I know the sequence varies a bit chip to chip. I remember some working without that call to enable, while some needed it

22:43 <re_irc> <@peter9477:matrix.org> I can't see it mentioned at all in nRF52840 datasheet, so I assume not particular vendor-specific. I must have read it in ARM docs somewhere.

22:43 <re_irc> <@peter9477:matrix.org> * would assume not particularly

22:43 <re_irc> <@jamesmunns:beeper.com> (when the debugger is not attached)

22:45 <re_irc> <@jamesmunns:beeper.com> https://github.com/rtic-rs/rtic/issues/123

22:45 <re_irc> <@jamesmunns:beeper.com> Seems I ran into that on the nrf52 in 2018 :p

22:49 <re_irc> <@peter9477:matrix.org> Ah, found it mentioned here at least... maybe never did find it in ARM docs: https://docs.rs/cortex-m/0.7.7/cortex_m/peripheral/struct.DWT.html#method.enable_cycle_counter

22:50 <re_irc> <@peter9477:matrix.org> > Enables the cycle counter

22:50 <re_irc> > The global trace enable (DCB::enable_trace) should be set before enabling the cycle counter, the processor may ignore writes to the cycle counter enable if the global trace is disabled (implementation defined behaviour).

23:43 IlPalazzo-ojiisa has quit [Remote host closed the connection]