#amaranth-lang on 2023-10-02 — irc logs at libera.irclog.whitequark.org

2023-08-14 23:59 whitequark[cis] changed the topic of #amaranth-lang to: Amaranth hardware definition language · weekly meetings: Amaranth each Mon 1700 UTC, Amaranth SoC each Fri 1700 UTC · code https://github.com/amaranth-lang · logs https://libera.irclog.whitequark.org/amaranth-lang · Matrix #amaranth-lang:matrix.org

00:01 <tpw_rules> good news https://github.com/tpwrules/de10_nano_nixos_demo now has amaranth! amaranth is so nice

00:11 chaoticryptidz has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

00:17 Degi has quit [Ping timeout: 264 seconds]

00:20 Degi has joined #amaranth-lang

00:27 <whitequark[cis]> <tpw_rules> "hm i remember amaranth output..." <- it's going to get worse before it gets better, basically

00:27 <whitequark[cis]> <tpw_rules> "good news https://github.com/..."; <- niiiiice

01:28 chaoticryptidz has joined #amaranth-lang

01:48 chaoticryptidz has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

01:50 chaoticryptidz has joined #amaranth-lang

02:26 urja has quit [Read error: Connection reset by peer]

02:43 urja has joined #amaranth-lang

06:57 GenTooMan has quit [Ping timeout: 260 seconds]

06:59 notgull has quit [Ping timeout: 255 seconds]

07:02 notgull has joined #amaranth-lang

07:30 GenTooMan has joined #amaranth-lang

07:32 Darius has joined #amaranth-lang

07:50 Darius has quit [Ping timeout: 246 seconds]

07:51 Darius has joined #amaranth-lang

08:36 sauce has joined #amaranth-lang

14:30 GenTooMan has quit [Ping timeout: 272 seconds]

15:08 GenTooMan has joined #amaranth-lang

15:16 GenTooMan has quit [Ping timeout: 260 seconds]

15:48 <_whitenotifier-f> [rfcs] whitequark opened pull request #27: Testbench functions for the simulator - https://github.com/amaranth-lang/rfcs/pull/27

15:49 <_whitenotifier-f> [rfcs] whitequark edited pull request #27: Testbench functions for the simulator - https://github.com/amaranth-lang/rfcs/pull/27

15:58 jjsuperpower has quit [Ping timeout: 272 seconds]

16:30 GenTooMan has joined #amaranth-lang

17:02 <whitequark[cis]> good morning everyone, today is the time for our regular weekly meeing

17:02 <whitequark[cis]> er. evening. good evening. well, for some it's a morning

17:03 <whitequark[cis]> similar to the previous time, today we have one RFC on the agenda: https://github.com/amaranth-lang/rfcs/pull/27 Testbench functions for the simulator

17:03 <whitequark[cis]> rendered: https://github.com/whitequark/amaranth-rfcs/blob/simulator-testbenches/text/0000-simulator-testbenches.md

17:04 <whitequark[cis]> who is in attendance today?

17:05 <galibert[m]> Me

17:06 Chips4MakersakaS has joined #amaranth-lang

17:06 <Chips4MakersakaS> Me too

17:06 jfng[m] has joined #amaranth-lang

17:06 <jfng[m]> me

17:08 <crzwdjk> Me

17:09 <Chips4MakersakaS> To be honest I have never came to the point where I could grasp what the yield in a testbench actually does; I did use cocotb on Verilog output. As a consequence I'm afraid this RFC is above my head.

17:10 <galibert[m]> yield sucks, it means four different things depending on the context

17:11 <whitequark[cis]> Chips4Makers (aka Staf Verhaegen): I see

17:11 <whitequark[cis]> do you mean yield in general, or the specific case of bare yield?

17:11 <whitequark[cis]> galibert: though so does `await`

17:12 <galibert[m]> Yeah, it’s no better

17:12 <galibert[m]> And the performance is impressively low

17:12 <whitequark[cis]> I think it is better; at least now you have the same syntax for yielding control to the simulation or to the OS, and it eliminates one of the forms (there's no bare await)

17:12 <Chips4MakersakaS> yield in the conext of an Amaranth testbench, not general python, I do make generators in my python code.

17:13 <whitequark[cis]> I'm not sure where the performance claims come from considering we don't have async testbenches

17:13 <whitequark[cis]> Chips4Makers (aka Staf Verhaegen): are you familiar with coroutines?

17:13 <Chips4MakersakaS> yes

17:14 <whitequark[cis]> so yield val and yield val.eq(other) are essentially just a way to avoid global state by delegating an operation to "the caller of this coroutine", if this makes any sense

17:15 <whitequark[cis]> the caller of the coroutine (the simulator) knows what it is, so when the coroutine returns a value or an Assign statement as a value, it updates its state

17:15 <whitequark[cis]> without that we'd have to define some sort of implicit global for "the current simulator" which doesn't feel especially good

17:19 <Chips4MakersakaS> Main difficulty for me is how the time and clock advances with the yields. With cocotb you could explicitly wait for rising edge or for a certain amount of time. But have to admit that is also rusty ATM.

17:19 <whitequark[cis]> right, so this is actually what this RFC is about

17:20 <whitequark[cis]> you can wait for a rising edge with yield Tick("domain") (and a future RFC will allow waiting for a rising edge on an arbitrary signal)

17:22 <galibert[m]> The performance claim comes from comparing the result of a reimplementation of the 6502 pla and comparing its output with probes in perfect6502. Trying all 2**14 values was sub-second for perfect6502 and minute-range for the sim in amaranth

17:22 <galibert[m]> abysmal

17:23 <whitequark[cis]> that sort of stuff is what cxxsim is for

17:23 <whitequark[cis]> the Python simulator in Amaranth is actually something like 2x faster than Migen's while being more flexible

17:24 <galibert[m]> Oh I'm not saying migen is in any way good

17:24 <whitequark[cis]> the simulator optimized for its very low startup latency: you can start evaluating against your netlist almost immediately, without any compilation step

17:24 <whitequark[cis]> (and portability)

17:25 <whitequark[cis]> anyway, discussion of performance isn't on topic since that's not what the RFC is about; it's about usability

17:25 <tpw_rules> (which will benefit any future simulator too, correct?)

17:26 <whitequark[cis]> yes, this is a general change in the simulator interface

17:28 <jfng[m]> merge; from a purely ergonomic POV, i think this is worth the increase in API surface

17:28 <jfng[m]> i currently end up using `yield; yield Settle()` a lot in synchronous testbenches, and having to think about it is a bit tedious

17:29 <whitequark[cis]> it also prevents us from adding yield from fifo.read() or such

17:31 <Chips4MakersakaS> So still learning. How would testbench look like for nultiple domain design ?

17:31 <Chips4MakersakaS> * for nultiple clock domain design

17:31 zyp[m] has joined #amaranth-lang

17:31 <zyp[m]> while we're updating the simulator interface, would it be worthwhile to explore whether using await in place of some of the yields would be more ergonomic?

17:32 <whitequark[cis]> zyp: that would be the topic of another (planned) RFC

17:33 <whitequark[cis]> I'm not entirely sure what is the viable approach there yet

17:33 <whitequark[cis]> you have three options: async def, async def generator, and def generator

17:33 <whitequark[cis]> all three can potentially be mixed

17:34 <whitequark[cis]> it is unclear what are the costs of allowing such mixing, and what are the costs of adding e.g. a separate AsyncSimulator or the like

17:34 * galibert[m] posted a file: (4KiB) < https://catircservices.org/_matrix/media/v3/download/matrix.org/POuolfqPvJelmYwQvxjVJyQe/ttest.py >

17:34 <galibert[m]> Chips4Makers (aka Staf Verhaegen):

17:34 <whitequark[cis]> backwards compatibility is important, but preserving the existing level of performance is pretty valuable too, and it's not clear that adding compatibility shims won't make it unacceptable

17:34 <zyp[m]> would the other RFC deprecate the new interface this RFC proposes, or would they coexist?

17:35 <whitequark[cis]> add_testbench is here to stay, just like add_sync_process

17:35 <galibert[m]> (or you mean in the specific case of add_testbench?)

17:35 <whitequark[cis]> there are use cases you cannot practically achieve without either

17:35 <whitequark[cis]> i.e. you cannot write testbenches (especially not while abstracting out functions) without add_testbench, and you cannot replace RTL with behavioral code without add_sync_process

17:36 <zyp[m]> what's the value of add_sync_process if Settle is deprecated?

17:36 <whitequark[cis]> (not without adding a lot of overhead and complexity that makes it impractical to emulate one with the other)

17:36 <whitequark[cis]> add_sync_process lets you pretend to be a flop

17:36 <Chips4MakersakaS> galibert: TY

17:37 <whitequark[cis]> the ability to observe comb output values "just before" the clock edge is something you need to be able to easily replace synchronous logic with a Python function

17:38 <whitequark[cis]> or in other words: two add_testbench waiting on the same clock edge will race (it is undefined which order they are evaluated at), two add_sync_process waiting on the same clock edge are OK (they are evaluated simultaneously and see the same values even if they use .eq in the body)

17:39 <zyp[m]> and that use of add_sync_process doesn't need Settle?

17:40 <whitequark[cis]> no? using Settle there completely breaks it, in fact

17:40 <whitequark[cis]> (because now there will be a race)

17:42 <zyp[m]> okay, I haven't used the amaranth simulator enough to have a clear picture of how everything fits together yet

17:43 <zyp[m]> but I've tried to model DDR IO registers in the migen simulator, and that sounds like it'd be a lot easier in amaranth :)

17:45 <whitequark[cis]> we are approaching the end of our hour long slot

17:45 <whitequark[cis]> does anyone here have comments on the technical substance of the RFC?

17:46 <Chips4MakersakaS> Not me.

17:46 <whitequark[cis]> (aside from jfng who suggests merging)

17:47 <whitequark[cis]> I'm wary of merging an RFC that nobody understands, but the issue is partly that nobody understands it because the current system is incredibly confusing and hard to teach

17:47 <crzwdjk> Seems to make things easier for my use case of writing testbenches. So I would vote merge as well.

17:47 <galibert[m]> I don't use add_sync_process because I rarely have only one clock domain, so abstain

17:47 <whitequark[cis]> galibert: it applies exactly the same to you calling `yield Tick()`

17:47 <whitequark[cis]> i.e. you won't need to use yield Tick(); yield Settle()

17:47 <galibert[m]> (it's close to impossible to emulate an old processor without at least two phases)

17:48 <galibert[m]> I... don't?

17:48 <whitequark[cis]> then it sounds like your use case will be largely or entirely unaffected

17:49 <whitequark[cis]> (thinking about how you would use a multi-phase setup, it probably isn't subject to the issues this RFC is attempting to solve)

17:52 <whitequark[cis]> I think I'm going to end the meeting here; we have a weak consensus towards merge but I'm wary of merging something so few people actually understand

17:52 <whitequark[cis]> I'm going to revisit it next time so it would be great if zyp and Chips4Makers (aka Staf Verhaegen) would be able to look into the current and proposed mechanics?

17:52 <whitequark[cis]> Chips4Makers (aka Staf Verhaegen): happy to have a 1-1 with you to get into the gnarly details if that's something you'd have time for

17:54 <Chips4MakersakaS> I will have to see.

17:54 <whitequark[cis]> all right

17:54 <whitequark[cis]> that's it from me for today then

17:55 <_whitenotifier-f> [rfcs] whitequark commented on pull request #27: Testbench functions for the simulator - https://github.com/amaranth-lang/rfcs/pull/27#issuecomment-1743485923

20:06 jjsuperpower has joined #amaranth-lang

22:04 <mcc111[m]> So I've just learned that the Cyclone V has onboard memory ("BRAM" units?). Does Amaranth have the ability to allocate BRAM directly?

22:05 Wanda[cis] has joined #amaranth-lang

22:05 <Wanda[cis]> via the Memory class, yes

22:05 <Wanda[cis]> it's... in a bit of a rough shape unfortunately, but there are plans for fixing it

22:07 <Wanda[cis]> generally for block RAMs you have the options of either instantiating the underlying vendor primitive manually (Instance in amaranth terms) and having fun connecting the myriad wires, or letting the HDL pick a memory primitive for you (which, in amaranth, means using Memory)

22:08 <Wanda[cis]> and which one you choose basically depends on how "standard" your needs are

22:09 <Wanda[cis]> (if you don't have experience with this, go for Memory and let's hope it's not broken in some funny way in the Intel flow)

22:10 <mcc111[m]> OK, interesting

22:10 <mcc111[m]> At the moment I'm running on the Pocket which doesn't use the "real" intel platform, so I might have to allocate/instantiate it in Verilog and somehow plug it into the amaranth core from there

22:11 <Wanda[cis]> the Intel platform code isn't involved here

22:12 <Wanda[cis]> Memory gets translated to target-independent Verilog code

22:12 <Wanda[cis]> which ... Quartus will hopefully be able to make sense of, but it's a touchy area

22:13 <Wanda[cis]> (there are plans for actually having the vendor cell hooked up by amaranth platform code instead of letting Verilog synthesizer do this, but we're not doing that yet)

22:14 adamgreig[m] has joined #amaranth-lang

22:14 <adamgreig[m]> ime Memory should work fine

22:15 <adamgreig[m]> though.... not sure I've tried synthesising it since yosys's big memory processing updates

22:16 <Wanda[cis]> in this case it's a question of what Quartus is doing, not yosys

22:16 <Wanda[cis]> (well... a bit of both, since yosys is tasked with emitting the Verilog)

22:17 <tpw_rules> fwiw i like to use quartus's IP designer and just make a memory block and use it through an Istance

22:17 <tpw_rules> it's in some ways the worst of both worlds but it is guaranteed to work

22:17 <tpw_rules> maybe i'm just overly cautious too. but that's how i do it for big important memories at least

22:19 <Wanda[cis]> you know

22:19 <Wanda[cis]> it may just work

22:20 <Wanda[cis]> but ... yeah, pushing memories through Verilog is extraordinarily fragile

22:21 <Wanda[cis]> it's basically compiling down the description of your memory into imperative code, then having the synthesis tool pattern-match and decompile what you wrote back into something it understands

22:21 <tpw_rules> quartus also nicely guides you through it especially if BRAM is an unfamiliar term

22:22 <adamgreig[m]> i have way too many memories to click through a wizard and make an instance for all of them, though

22:22 <tpw_rules> yeah totally fair. but if it's your first time even hearing the term that really isn't a bad way to do it

22:22 <adamgreig[m]> for ecp5 dual port read+write and some other edge cases i've manually instantiated the ecp5 primitives, but mostly Memory has worked well

22:23 <adamgreig[m]> yea for sure, even if you just use it to find out all the settings and things and then throw it out

22:23 <Wanda[cis]> so whether it works depends on whether Quartus is happy with the Verilog code patterns that yosys emits

22:24 <tpw_rules> fwiw too Quartus also seemed happy with whatever litex did

22:24 <Wanda[cis]> litex is migen though

22:24 <Wanda[cis]> doesn't go through yosys verilog emitter

22:25 <tpw_rules> ah

22:25 <tpw_rules> knew the first, not the second

22:27 <Wanda[cis]> FWIW memory pattern recognition code (for Verilog into yosys direction) is some of the most cursed shit I wrote; at some points it basically guesses what you meant and asks a SAT solver to prove that its guess is correct

22:27 <Wanda[cis]> and from what I know, some vendor tools don't bother with the second part of that

23:09 chaoticryptidz has quit [Read error: Connection reset by peer]

23:11 chaoticryptidz has joined #amaranth-lang

23:25 lf has quit [Ping timeout: 258 seconds]

23:25 lf has joined #amaranth-lang