#rust-embedded on 2022-03-22 — irc logs at libera.irclog.whitequark.org

2022-02-07 19:20 ChanServ changed the topic of #rust-embedded to: Welcome to the Rust Embedded IRC channel! Bridged to #rust-embedded:matrix.org and logged at https://libera.irclog.whitequark.org/rust-embedded, code of conduct at https://www.rust-lang.org/conduct.html

02:17 starblue has quit [Ping timeout: 256 seconds]

02:18 starblue has joined #rust-embedded

03:50 starblue has quit [Ping timeout: 252 seconds]

03:51 starblue has joined #rust-embedded

07:05 emerent has quit [Ping timeout: 256 seconds]

07:05 emerent has joined #rust-embedded

07:44 lowq has quit [Ping timeout: 240 seconds]

09:10 diagprov has joined #rust-embedded

10:49 <re_irc> <yruama_lairba> firefrommoonlight: i don't have a SAI device

11:04 starblue has quit [Ping timeout: 256 seconds]

11:06 starblue has joined #rust-embedded

11:20 gsalazar has joined #rust-embedded

11:22 <re_irc> <Etienne Assoumou Mengue> Hi guys, my company and I are embedded enthusiast. we are playing/learning/developing with embedded rust since 2 year on stm32 & nrf52 MCUs. We would like to contribute and be part of the community. Could you guys please guide us how to do it and also to become a member of the embedded rust team(Cortex-M team).

11:22 <re_irc> Thank you.

11:27 <re_irc> <caspinol> Hello all, i'm using the stm32f4xx hal crate and i'm having trouble to find a way to configure a time running in a output compare mode where the timer drivet the external pin directly. I can of course emulate it with interrupt or by using the PWM but its not obvious to me how to do this directly with timer API. Can anyone share some pinters to the documentation or example perhaps?

11:56 <re_irc> <therealprof> Etienne: 👋 Also thanks for your email. I'm replying here because it might be interesting for others as well. We do have a few pointers on how to get started with the embedded WG here: https://github.com/rust-embedded/wg. There are no formal requirements for contributions to our projects, except for die abidence of the CoC (https://www.rust-lang.org/policies/code-of-conduct). I would, given your mentioned interest...

11:57 <re_irc> ... in Cortex-M, highly recommend you look around the related projects and just dive in by commenting or creating new issues and PRs. You're also very welcome to join the discussion in our weekly meeting (every Tuesday, i.e. today at 8:00 PM right here) and maybe introduce yourself and give a little bit of details on what you're planning to work in particular.

12:08 <re_irc> <Imran K> Hello all, I am writing a flash driver for "stm32h723zg" which is a "256 bit" data align, if I am trying to write a single byte it is not working. Can you help me with how to write single byte data in Flash? My board architecture is "32 bit" but it is working on 32 byte align data How is it?

13:06 <re_irc> <caspinol> * timer

13:24 cr1901_ has joined #rust-embedded

13:28 cr1901 has quit [Ping timeout: 260 seconds]

13:58 diagprov has quit [Quit: diagprov]

14:17 cr1901_ has quit [Remote host closed the connection]

14:17 cr1901_ has joined #rust-embedded

15:25 creich_ has joined #rust-embedded

15:27 creich has quit [Ping timeout: 240 seconds]

15:54 cr1901_ is now known as cr1901

15:56 lowq has joined #rust-embedded

16:18 starblue has quit [Ping timeout: 256 seconds]

16:20 starblue has joined #rust-embedded

19:01 <re_irc> <adamgreig> hi room, meeting time again! we'll start in 5 min, agenda link on its way...

19:02 <re_irc> <adamgreig> agenda: https://hackmd.io/S90QgkEOTi-tdSHi7MEBtA please add anything you'd like to announce or discuss!

19:06 <re_irc> <adamgreig> ok, let's start! I've only just got my computer back up yesterday so haven't seen anything to announce, did anyone see anything last week?

19:08 <re_irc> <therealprof> I was hoping we could announce a blog post. 😉

19:09 <re_irc> <adamgreig> hah, yea, shall we publish the current newsletter-next this evening then?

19:09 <re_irc> <adamgreig> anyone feel like cutting a PR for it?

19:10 <re_irc> <adamgreig> I'll take a look later otherwise

19:11 <re_irc> <adamgreig> ok, otherwise mostly a few items closed since last week

19:11 <re_irc> <adamgreig> svd2rust got https://github.com/rust-embedded/svd2rust/pull/579 merged which changes array accessors, I guess we're getting closer to a new svd2rust release

19:11 <re_irc> <adamgreig> cortex-m got its prelude (which was just an embedded-hal 0.2 re-export) removed for 0.8, which we're also getting closer to

19:12 <re_irc> <adamgreig> probably push semihosting first in https://github.com/rust-embedded/cortex-m/pull/424 and then maybe consider new cortex-m-rt before c-m though

19:13 <re_irc> <adamgreig> currently there aren't many big changes in 0.8, mostly around the debug peripherals (and of course swapping to stable "asm!()", but it's not breaking), so it might be worth thinking about other changes before cutting it

19:13 <re_irc> <adamgreig> though having not thought about it for a little while i'd need to revisit some old discussions to have any plans live in my head...

19:13 <re_irc> <therealprof> Are there other changes?

19:14 <re_irc> <adamgreig> we remove the deprecated "ptr()" in favour of the const "PTR"

19:14 <re_irc> <adamgreig> but no significant changes to other peripherals, afaik

19:15 <re_irc> <adamgreig> there are a few pending PRs that should definitely get in first, 377/383/387/389/422

19:15 <re_irc> <adamgreig> but they are mostly debug related, finishing up the earlier debug changes

19:16 * cr1901 has nothing to add on his end, so is just catching up on the ARM changes

19:18 * re_irc adamgreig waves at cr1901

19:18 <re_irc> <adamgreig> at least the bridge is working 😓

19:19 <re_irc> <adamgreig> I guess the other thing with cortex-m 0.8 is whether to do the semver hack thing or otherwise try to think of a way around it

19:19 <re_irc> <adamgreig> something something owned singletons, heh

19:20 <re_irc> <newam> Are there any plans to change/remove the singletons in the future?

19:21 <re_irc> <adamgreig> they're such a nuisance, so yea, maybe

19:21 <re_irc> <adamgreig> things we've considered in the past include splitting into a pac and hal crate, or a separate crate just for singletons that won't need version bumping often

19:22 <re_irc> <dirbaio> I'd go singleton-less PAC + singletoned HAL

19:22 <re_irc> <adamgreig> like, imagine cortex-m-singletons gave out instances of DWT, TPIU, NVIC, whatever: we could add new peripherals as Arm adds them, but basically the list is fixed and known, and the HAL crate can do breaking changes whenever it likes, but an application could happily have multiple HALs/HAL versions

19:22 <re_irc> <adamgreig> the problem with singletoned HAL is you now need to enforce a single instance of the HAL?

19:22 <re_irc> <adamgreig> but that's the part that changes most

19:23 <re_irc> <dirbaio> yeah... I've given up on that in embassy

19:23 <re_irc> <adamgreig> indeed

19:23 <re_irc> <adamgreig> perhaps you could have singletons in their own singleton crate and the HAL consumes them, though?

19:23 <re_irc> <adamgreig> if you wanted to keep them, I mean

19:23 <re_irc> <dirbaio> it's "single instance of singletons" xor "allow mixing HAL versions"

19:23 <re_irc> <adamgreig> (and the PAC would also consume them separately/instead of the HAL, if you wanted direct access? or more like, the HAL would consume them via the PAC)

19:24 <re_irc> <dirbaio> the singletons crate might work but then you're committing to a particular way of splitting up the peripherals

19:24 <re_irc> <adamgreig> yea

19:24 <re_irc> <adamgreig> well, you could imagine someone consuming the NVIC singleton and providing 32 new IRQ singletons or whatever

19:24 <re_irc> <adamgreig> but obviously that's going to be less inter-compatible

19:24 <re_irc> <dirbaio> for example in MCU HALs I've found a singleton per gpio pin is wayyyy more ergonomic than a singleton per gpio port

19:25 <re_irc> <dirbaio> and with NVIC there's the similar thing yup

19:25 <re_irc> <adamgreig> sure, but it doesn't reflect the required synchronisation, right?

19:25 <re_irc> <adamgreig> if you have a singleton per pin you can't be sure you have exclusive access to the port registers

19:25 <re_irc> <adamgreig> so you gotta do something else - critical section, platform specific atomic ops, whatever

19:25 <re_irc> <dirbaio> if your registers allows atomic access then yes, otherwise you do critical sections

19:26 <re_irc> <dirbaio> it's a tradeoff

19:26 <re_irc> <adamgreig> of course the practical reality is that all the HALs just have to do this anyway because of course ultimately they offer per-pin singletons

19:26 <re_irc> <dirbaio> different HALs might want to do that tradeoff differently, and you can't if you want all HALs to use the same "singletons crate"

19:26 <re_irc> <adamgreig> inside the cortex-m crate(s) though the main thing would be to stop having to do semver hacks and breaking the whole ecosystem with each version bump

19:26 <re_irc> <adamgreig> so even if it was an implementation detail of cortex-m as-is, it might still be a big improvement

19:26 <re_irc> <dirbaio> so I don't think the "singletons crate" idea solves 100% of the problems

19:27 <re_irc> <dirbaio> so I'm not sure it's worth the extra complexity

19:27 <re_irc> <dirbaio> it does solve many

19:27 <re_irc> <dirbaio> dunno

19:27 <re_irc> <newam> dirbaio: Yeah, this is currently a messy problem for the STM32WL where ST expects you to compile two separate binaries for each CPU.

19:28 <re_irc> <dirbaio> oh yeah you can't enforce singleton singletonness at all in the multicore case 😂

19:28 <re_irc> <dirbaio> same for nrf53

19:29 <re_irc> <newam> -two

19:29 <re_irc> <adamgreig> yea, the multi-arch multi-core is a whole other problem too...

19:30 <cr1901> singleton as in e.g. the PERIPHERALS struct from a PAC?

19:30 <re_irc> <adamgreig> so I think we could do the cortex-m-singletons very soon and then later split into cortex-m-hal and cortex-m-pac and at least that might mean that for 0.8 we never need future semver hacks

19:30 <re_irc> <adamgreig> we could even do a 0.7.5 release that used the new -singletons crate

19:30 <re_irc> <adamgreig> which would then become back-compat down to 0.5 or whatever (horrifying thought)

19:30 <re_irc> <adamgreig> cr1901: yea, exactly

19:30 <re_irc> <adamgreig> execpt the PERIPHERALS struct from cortex-m crate in this case

19:31 <re_irc> <adamgreig> dirbaio: do you just give up on having exclusive access to a peripheral, then, and assume all access needs other synchronisation?

19:31 <cr1901> Yea, I thought we punted on the multi-core case- no way to explain between two cores w/o some sort of runtime that one core owns the singleton

19:31 <re_irc> <adamgreig> or does it just become unsound if your users end up with two versions?

19:32 <re_irc> <adamgreig> cr1901: yea, for now that's the status, it's just "no official support, DIY, it's not yet clear how it might fit into rust send/sync, though people have various ideas around modelling it as separate processes needing IPC except they share a memory space and some singletons..."

19:32 * cr1901 nods

19:34 <re_irc> <dirbaio> adamgreig: gave up on safety when mixing HALs, or multiple major versions of the same HAL. If you don't mix, it's all safe

19:35 <re_irc> <dirbaio> some people do want to mix

19:35 <re_irc> <adamgreig> that's the same as cortex-m except we use the "links" field to break the build if they would have had multiple major versions

19:35 <re_irc> <adamgreig> and it wrecks the ecosystem every time

19:36 <re_irc> <dirbaio> allowing mixing unsafely is better than completely failing the build...

19:36 <re_irc> <newam> cr1901: You can do it at compile time too, if you split up peripherals into individual singletons. Requires a number of hacks though since rust isn't great at handling compile-time options.

19:36 <re_irc> <adamgreig> if we didn't do that, you'd get extremely regular multiple major versions

19:36 <re_irc> <newam> * singletons, and assign each to a core statically.

19:36 <re_irc> <adamgreig> and I suspect you'd end up with a lot of people passing a 0.6 NVIC to a driver that wants 0.8 or whatever on top

19:36 <re_irc> <dirbaio> not many drivers out there taking cortex-m singletons though?

19:37 <re_irc> <dirbaio> usually it's the user configuring the irqs

19:37 <re_irc> <adamgreig> yea, true, mostly the problem is PACs and HALs using old cortex-m and applications wanting to use new cortex-m

19:39 <cr1901> newam: Ack. Maybe one day we'll have a const-fn Peripherals.split for all possible permutations of peripherals :P

19:39 <re_irc> <adamgreig> well that's probably enough ink spilled on cortex-m for tonight, is there anything on embedded-hal?

19:41 <re_irc> <GrantM11235> I just realized it has been a whole week since I started writing a response for https://github.com/rust-embedded/embedded-hal/pull/374 and I still haven't finished it yet

19:42 <re_irc> <GrantM11235> re: performance: I don't think that multiple calls to "write" will inline with opt "s" or "z", will they?

19:43 <re_irc> <therealprof> "depends"

19:44 <re_irc> <GrantM11235> > > This will also allow us to add new methods in the future with default implementations as a non-breaking change.

19:44 <re_irc> > Why is that not the case already, with the current slice-based methods?

19:44 <re_irc> For the most part this is true (aside from possible performance concerns), however there are some case where this could lead to problems.

19:44 <re_irc> For example, if we add a write_iter method, it could have a default impl using len=1 slices. Unfortunately, it would then be impossible to then give write_slice a default impl based on write_iter

19:46 <re_irc> <dirbaio> > impossible to then give write_slice a default impl based on write_iter

19:46 <re_irc> > for that to

19:46 <re_irc> <dirbaio> * for that you'd have to make write_iter mandatory (no defualt impl) which would be breakin

19:46 <re_irc> <dirbaio> * breaking

19:48 <re_irc> <adamgreig> (newsletter released, thanks eldruin! https://blog.rust-embedded.org/newsletter-31/ 🎉)

19:49 <re_irc> <GrantM11235> But with the word methods, you could add write_iter in a non breaking way that uses write_word, and you could switch the write_slice method to use write_iter by default (also non-breaking)

19:49 <cr1901> GrantM11235: Rust has a non-zero threshold for inlining even at opt-level="z"

19:50 <re_irc> <dirbaio> and you can tune with "inline-treshold"

19:50 <cr1901> At "s" it's 95, "z" it's "75"

19:50 <cr1901> at "3" it's 285 or 275

19:50 <cr1901> There's a page in the official docs for it

19:50 <re_irc> <dirbaio> i'm not a fan of write_iter either

19:50 <cr1901> dirbaio: Oh, cool, TIL

19:51 <re_irc> <dirbaio> people love doing stuff like "spi.write_iter(slice1.iter().chain(slice2.iter()))"

19:51 <re_irc> <dirbaio> which generates bloated code and performance craters with DMA

19:51 <re_irc> <dirbaio> vs something like "spi.write(slice1); spi.write(slice2);"

19:51 <Lumpio-> Specialize it for an iterator of slices :-)

19:52 <Lumpio-> ...if only we could specialize

19:53 <re_irc> <therealprof> cr1901: There're multiple levels of inlining...

19:53 <re_irc> <GrantM11235> Sometimes you need to accept an iterator and send it via spi, like in display-interface-spi

19:54 <re_irc> <therealprof> You can also use a slice but since e-g uses iterators...

19:54 <re_irc> <GrantM11235> Or any display driver that uses embedded-graphics

19:54 <re_irc> <dirbaio> on some chips like nrf, you _can't not use DMA_ :P

19:55 <re_irc> <dirbaio> which makes display-interface is quite slow on nrf

19:55 <re_irc> <dirbaio> and in async you get horrible performance without DMA as well

19:55 <re_irc> <GrantM11235> dirbaio: Yeah, the hal will need to copy to a buffer, same as when sending data that is in flash instead of ram

19:55 <re_irc> <therealprof> Well, nothing prevents you from specialising the impl for nrf by buffering and using DMA.

19:56 <re_irc> <dirbaio> yeah yeah I know you can workaround

19:56 <re_irc> <dirbaio> but should the e-h traits mandate functionality that _requires_ workarounding?

19:57 <re_irc> <dirbaio> vs if you have slice "write" only

19:57 <re_irc> <dirbaio> you impl write_word, write_iter on top of that

19:58 <re_irc> <dirbaio> you get good performance with the right compiler flags to get "write" to inline

19:58 <re_irc> <therealprof> The problem is if you're coming from an iterator interface you don't know how much data there will be.

19:58 <re_irc> <dirbaio> and oyu're not encouraging use of apis that are _impossible_ to make fast on some chips or on async

19:58 <re_irc> <GrantM11235> dirbaio: They should allow you to do what you need to do for your driver. If workarounds are required, it is better to do that in the hal IMO

19:59 <re_irc> <dirbaio> the producer of the iterator is in the best position to know the len

19:59 <re_irc> <therealprof> Not reaally.

19:59 <re_irc> <therealprof> * really.

19:59 <re_irc> <dirbaio> if someone knows, it's whoever created the iter

19:59 <re_irc> <dirbaio> lower layers just know "it's an "impl Iterator""

20:00 <re_irc> <therealprof> If you're drawing a circle, how would you know how much pixels that would create?

20:00 <re_irc> <dirbaio> so higher layers are in the best position to size buffers etc

20:01 <re_irc> <therealprof> Those assumptions can only be wrong.

20:01 <re_irc> <GrantM11235> The higher layers don't even know if a buffer is required at all

20:01 <re_irc> <therealprof> GrantM11235: Exactly.

20:01 <re_irc> <dirbaio> imo _if_ EH has write_word, write_iter they should be default-impl'd on top of write_slice

20:02 <re_irc> <dirbaio> and users who care about perfromance set the right compiler flags for write_slice to get inlined

20:04 <re_irc> <GrantM11235> dirbaio: I don't think that makes sense for anything other than nrf

20:04 <re_irc> <therealprof> dirbaio: Inlining is not enough to get perfect performance all the time. If you write a slice you will have to determine a usable buffer size, those can only be wrong.

20:05 <re_irc> <therealprof> If you're not using DMA, NOT copying to a buffer is often more efficient.

20:06 <re_irc> <eldruin> I would also see writing slices as the common denominator for default impls but let's not forget that anyone that cares about performance should rather look into a dedicated implementation and not a default.

20:07 <re_irc> <dirbaio> with this, you could have write_word, write_iter with ok prformance without dma, and bad performance with dma

20:07 <re_irc> <therealprof> Iterating the buffer can take a significant amount of CPU cycles which you could use to fill the send buffer.

20:07 <re_irc> <dirbaio> if you're using write_Iter there's no way to get good performancew tih dma

20:07 <re_irc> <dirbaio> * performance with

20:07 <re_irc> <dirbaio> but at least it'll work

20:08 <re_irc> <eldruin> * who

20:08 <re_irc> <GrantM11235> eldruin: Exactly. I think that default impls using word methods are perfect for very simple non-dma hals

20:09 <re_irc> <eldruin> I meant having a default impl for write_word and write_iter that just calls write_slice(&[word])

20:10 explore has joined #rust-embedded

20:10 <re_irc> <eldruin> HALs can provide real implementations where the slicing is skipped for performance

20:11 <re_irc> <adamgreig> (oops, we're a bit over time for meeting, thanks everyone! I have to run but feel free to keep discussing this)

20:11 <re_irc> <GrantM11235> The hal probably already has a private write_word function that it uses to impl the e-h trait

20:11 <re_irc> <therealprof> That's possible, but I don't see how that performance would be any better than having a direct implementation of write_iter doing the same internally.

20:11 <re_irc> <eldruin> GrantM11235: exactly

20:12 <re_irc> <dirbaio> even if "write" doens't get inlined and gets called with len=1, I don't think it'd make a difference for performance

20:12 <re_irc> <dirbaio> it's, what, 20 extra clock cycles?

20:12 <re_irc> <dirbaio> sending one byte over SPI will probably be slower

20:15 <re_irc> <GrantM11235> The main motivation of the pr is so that simple non-dma hals can impl write_word (which they usually already do) and they don't need a bunch of extra "for word in slice" boilerplate

20:16 <re_irc> <dirbaio> but then DMA hals have to impl both "write", and "write_word" with "write(&[word])" boilerplate

20:16 <re_irc> <GrantM11235> A dma hal can easily override write_word with write_slice(&[word])

20:17 <re_irc> <dirbaio> AND the blocking trait becomes inconsistent with the async one

20:18 <re_irc> <GrantM11235> dirbaio: That is true. An async write_word would likely have too much overhead unless the spi speed was very slow, so I wouldn't recommend adding it

20:19 <re_irc> <dirbaio> if it was just the blocking one, i'd be 50/50 (both have boilerplate in some cases, and performance should be the same given sane compiler options)

20:19 <re_irc> <dirbaio> but consistency with async tips the balance in favor of having just slice "write" imo

20:20 <re_irc> <dirbaio> and there's the "encourage users to do the fast option" too

20:21 <re_irc> <dirbaio> slice "write" is fast both with and without DMA

20:21 <re_irc> "write_iter" is slow with DMA, fast without

20:21 <re_irc> <dirbaio> if you make "write_iter", "write_word" first-class stuff you encourage people to write stuff like "spi.write_iter(slice1.iter().chain(slice2.iter()))"

20:22 <re_irc> <dirbaio> due to that I think EH should not have "write_word", "write_iter" at all

20:23 <re_irc> <dirbaio> make the higher-level code write the "for byte in my_iter { spi.write(&[byte]) }"

20:23 <re_irc> <eldruin> maybe we should publish a new alpha with this spi redesign to see how it performs with displays and so on in reality

20:23 <re_irc> <dirbaio> this way you're making it very very obvious that it might be slow on some hardware

20:24 <re_irc> <dirbaio> and encourage using slices _if possible_

20:24 <re_irc> <dirbaio> if not possible (like e-g drawing a circle) then it's not

20:24 <re_irc> <dirbaio> but at least you're not _encouraging_ code that _can't be made fast_

20:24 <re_irc> <dirbaio> * fast on some hardware_

20:25 <re_irc> <dirbaio> if not possible (like e-g drawing a circle) then it's not, let them do N 1-byte writes

20:27 <re_irc> <dirbaio> same goes for linux-embedded-hal, doing a zillion 1-byte writes is sloooooow

20:27 <re_irc> <dirbaio> +too

20:27 <re_irc> <GrantM11235> I was on the fence about it before this meeting, but I think I am mostly convinced now that this is not worth it

20:27 Shell is now known as concha

20:27 concha is now known as Shell

20:29 <re_irc> <dirbaio> :D

20:29 <re_irc> <GrantM11235> You are very persuasive :)

20:30 <re_irc> <GrantM11235> Thanks everyone for the interesting discussion

20:30 <re_irc> <mutantbob> eldruin: Which new SPI design? I bumped into some problems that made me write https://github.com/mutantbob/arduino-spi yesterday.

20:31 <re_irc> <GrantM11235> mutantbob: This (soon to be closed) PR https://github.com/rust-embedded/embedded-hal/pull/374

20:33 <re_irc> <GrantM11235> Has there not been an alpha release since https://github.com/rust-embedded/embedded-hal/pull/351 was merged?

20:33 <re_irc> <eldruin> I meant making a new release including 351

20:33 <re_irc> <eldruin> not yet

20:35 <re_irc> <eldruin> mutantbob: that seems like a different problem caused by an implementation decision in the atmega-hal itself I would say. I know nothing about it though

20:36 <re_irc> <eldruin> Being it from rahix I am sure there is some reason behind that, otherwise he is probably grateful for improvements

20:37 <re_irc> <eldruin> Alright, I need to leave now, thanks everybody for the nice discussion!

20:40 <re_irc> <rahix> mutantbob: please check the answer I wrote in your issue, it should hopefully clear things up