#scopehal on 2023-07-14 — irc logs at libera.irclog.whitequark.org

2022-03-25 21:41 azonenberg changed the topic of #scopehal to: libscopehal, libscopeprotocols, and glscopeclient development and testing | https://github.com/glscopeclient/scopehal-apps | Logs: https://libera.irclog.whitequark.org/scopehal

01:21 veegee has joined #scopehal

02:56 Degi_ has joined #scopehal

02:57 Degi has quit [Ping timeout: 252 seconds]

02:57 Degi_ is now known as Degi

07:02 bvernoux has joined #scopehal

11:19 <_whitenotifier-6> [scopehal-sigrok-bridge] perigoso commented on pull request #2: use https instead of ssh for submodules - https://github.com/glscopeclient/scopehal-sigrok-bridge/pull/2#issuecomment-1635715621

21:06 <d1b2> <johnsel> hey @azonenberg not super related this question but it has been quite silent here for a while and people might take an interest. You are working on a DIY switch, correct? Do you have some insight into how to start making use of the SFP+ on the KC705 I got? Any projects I should take a look at or general approach tips?

21:07 <d1b2> <johnsel> now I should specify that I want to do 10G

21:07 <azonenberg> johnsel: So basically, a SFP+ is just a differential pair to light converter

21:08 <azonenberg> it has no intelligence, just some thresholding and a few simple feedback loops for sensitivity and tx power level

21:08 <d1b2> <johnsel> yup, the question is how do I build up something inside the FPGA to talk to/over it 🙂

21:08 <azonenberg> There's a few 3.3V GPIOs for things like enabling/disabling the transmit, detecting that a module is present, detecting faults

21:08 <azonenberg> an optional i2c bus that contains a descriptor EEPROM and (usually, but not required by spec) some sensors

21:09 <azonenberg> The actual data is 10Gbase-R coded

21:09 <azonenberg> Which is to say, 64/66b coded ethernet frames

21:09 <azonenberg> I have an open source MAC/PCS in my antikernel-ipcores repo that integrates nicely with a 7 series GTX

21:11 <azonenberg> https://github.com/azonenberg/antikernel-ipcores/blob/master/interface/ethernet/XGEthernetPCS.sv?ts=4

21:12 <azonenberg> https://github.com/azonenberg/antikernel-ipcores/blob/master/interface/ethernet/XGEthernetPCS.sv?ts=4

21:12 <azonenberg> oops

21:12 <azonenberg> https://github.com/azonenberg/antikernel-ipcores/blob/master/interface/ethernet/XGEthernetMAC.sv?ts=4

21:13 <azonenberg> XGMACWrapper is just a shell around those two to save you the trouble of instantiating the two modules directly

21:13 <d1b2> <johnsel> that's very useful already

21:13 <azonenberg> What you end up with is, on the internal-facing side, a data bus consisting of 32 data bits, a 312.5 MHz clock, a valid flag, and a bytes-valid counter

21:13 <azonenberg> plus a start flag that is asserted during the preamble (so you can reset per-packet state machines)

21:14 <azonenberg> and then at the end of a packet either commit goes high, indicating good checksum and everything went fine

21:14 <azonenberg> or drop goes high, indicating the packet was corrupted/malformed and should be ignored

21:14 <azonenberg> TX is the same bus sans drop flag, once you start sending you have to finish sending it

21:14 <azonenberg> on the other side, it expects to talk to the 7 series transceiver wizard configured for 10Gbase-R with, iirc, the asynchronous 64/66b gearbox

21:15 <azonenberg> also note that my XGMIIBus interface is not 802.3 compliant XGMII

21:15 <azonenberg> i swapped the lane numbering left to right, so that bytes would show up in a human readable order in logic analyzer / simulation traces

21:16 <azonenberg> and it's also single rate 312.5 MHz vs DDR 156.25 MHz since nobody uses ddr signals inside an fpga

21:22 <d1b2> <johnsel> Thanks, that's super useful already. I haven't looked very carefully, but it looked like you have some IPv4 packet related things written already, correct?

21:23 <azonenberg> I have a full IPv4, ICMP, ARP, and UDP stack

21:24 <azonenberg> It's intended as an embedded server, so it lacks client support for most of these protocols

21:24 <azonenberg> e.g. it can respond to incoming pings, but not initiate an echo request

21:24 <azonenberg> it also has a TCP server that is a WIP, it works great as long as you never drop a packet from the FPGA to the client

21:25 <azonenberg> it will correctly send ACKs and everything else so client-to-FPGA packet loss is well tolerated

21:25 <azonenberg> but it doesn't retransmit anything sent in the opposite direction

21:25 <d1b2> <johnsel> hmmm, do you have something you use to benchmark it?

21:25 <azonenberg> Not currently. I'm not actually using the stack for anything serious yet

21:25 <azonenberg> what i've actually used more seriously is the software tcp/ip stack, azonenberg/staticnet

21:25 <azonenberg> which is basically the same level of completion

21:26 <azonenberg> no tcp retransmits, no client support, no ipv6

21:26 <azonenberg> the difference is, this one has a ssh server implementation attached to it

21:26 <azonenberg> it's super bare bones and has no OS or library dependencies, in particular it explicitly does not use dynamic memory allocation

21:26 <d1b2> <johnsel> I see, not on a microblaze or other cpu core inside a FPGA I assume right?

21:27 <azonenberg> everything is based on fixed sized packet pools that are statically allocated

21:27 <azonenberg> It could hypothetically run on such

21:27 <azonenberg> but the intended use case is stm32h7

21:27 <azonenberg> i have a driver for the stm32h7 crypto accelerator to speed up SSH already, although it doesn't have elliptic curve functionality

21:27 <azonenberg> so i either do that in software or (in progress) integrate with an fpga curve25519 accelerator

21:28 <azonenberg> The intent for the all-FPGA stack is to be used on the open hardware scopes, since there's no way the stm32 tcp/ip stack can get remotely close to saturating a 10G link with packet data

21:28 <azonenberg> What i am beginning to explore is linking them

21:28 <azonenberg> so that things like arp, icmp, etc are handled on the MCU

21:28 <azonenberg> and low bandwidth management traffic like scpi goes to it

21:28 <azonenberg> but high speed stuff like the waveform sample datapath is all FPGA

21:29 <azonenberg> rather than having the waveform data and the management be considered two seaprate hosts with their own ip/mac i want to look into sharing state and packet data

21:29 <azonenberg> such that certain ports/protocols are implemented in software and others in hardware

21:29 <azonenberg> and you can trade back and forth depending on fpga area vs performance requirements

21:29 <d1b2> <johnsel> yeah you've told me about it before, it's an interesting idea

21:30 <azonenberg> anyway the reasdons for using the exxternal mcu are that it has a lot of sram (so doesnt compete with fpga block ram)

21:30 <azonenberg> it has a random number generator (so no need to use sketchy RNGs in the FPGA for crypto)

21:30 <azonenberg> and it can clock significantly faster than a typical softcore

21:31 <d1b2> <johnsel> You're basically doing Zynq but discrete now, haha

21:31 <azonenberg> Yes

21:31 <azonenberg> and with a cortex-M not an A

21:31 <d1b2> <johnsel> Anyway I'm looking into 10G for my scope project, so if I do build something useful I'll PR it back

21:31 <azonenberg> i like bare metal not linux

21:31 <azonenberg> And with the FPGA and MCU being explicitly decoupled

21:31 <azonenberg> e.g. the mcu cannot reprogram the FPGA unless you create an interface for it to do so

21:32 <azonenberg> one of the things i liked about the stm32h735 is that one of the package options (which i have not got my hands on yet, it's out of stock everywhere i looked) is a 68 pin QFN

21:32 <azonenberg> i could basically just have jtag, uart, quad SPI to the FPGA, and maybe a few debug LEDs

21:32 <azonenberg> and have it be a "brain on a stick" hanging off the FPGA

21:32 <azonenberg> xilinx's vision for zynq is an arm soc with an fpga accelerator as a peripheral

21:33 <azonenberg> my vision is an fpga with a microcontroller as a peripheral :p

21:33 <d1b2> <johnsel> Yeah different usecases

21:33 <d1b2> <johnsel> I get the industry move towards linux, it gets software people into hardware more easily, but there's definitely a lot of downsides to their current approach

21:34 <azonenberg> yeah. and the over-reliance on things like axi and linux makes it difficult to use any other way

21:34 <azonenberg> like you basically *have* to use the ip integrator in a zynq design

21:35 <d1b2> <johnsel> yeah that's the whole spiel, you get custom hardware in your SoC that you can drive from the fully featured Linux environment

21:35 <azonenberg> Yeah

21:35 <azonenberg> thats one of the things that bothers me about xilinx's future

21:35 <azonenberg> all of their marketing docs are presenting versal as the successor to ultrascale+

21:35 <azonenberg> they dont go out and say it, but it's strongly implied

21:36 <d1b2> <johnsel> they wouldn't, would they?

21:36 <d1b2> <johnsel> I think discrete FPGA will stay

21:36 <azonenberg> i.e. i fear that au+ / ku+ may be their last family of fpgas without an arm core you are forced to use to get any work done at all

21:36 <d1b2> <johnsel> it's just the AI craze taking hold

21:36 <azonenberg> I think it will stay across the industry

21:36 <azonenberg> I don't know if it it will stay *from xilinx*

21:36 <azonenberg> they seem all-in on versal and i dont like it

21:36 <d1b2> <johnsel> that would be the stupidest thing ever

21:37 <azonenberg> anyway, u+ isn't going away any time soon, even 7 series is going to be supported until at least like 2035 iirc

21:37 <azonenberg> So even if there's no next-gen platform afterwards, i have a long ways to go before my projects outgrow a ku5p :p

21:37 <azonenberg> Considering right now i'm working on a 7k160t and using a nontrivial amount of it, but nowhere near running out of space (yet)

21:39 <d1b2> <johnsel> Yeah for sure, I'm discussing a building an overpowered "Analog Discovery" with someone and he asked for Xilinx' latest series (as it would be good for marketing). I said their 7 series are still plenty fast enough for what we want to do.

21:40 <d1b2> <johnsel> it's a tough job to fully utilize one of those chips, especially on Kintex Serdes

21:40 <d1b2> <johnsel> and 12.8Gbit/s is plenty fast, especially if you have like 8 or 16 of them

21:43 <azonenberg> i mean, i have the opposite problem with ethernet lol

21:44 <azonenberg> LATENTORANGE is going to use as many serdes as i can find for switching N 10GbE lanes

21:44 <azonenberg> and then for the open scope project, i'll need a dozen JESD204B lanes to use the AD9213

21:44 <azonenberg> That's going to be my next big hardware project once i have the mini-switch done i think

21:45 <azonenberg> although it will be a multi step project, i need to do more work on the frontend (might borrow ideas from the thunderscope but i have my own frontend design i wanted to play more with too)

21:48 <d1b2> <johnsel> anyway to recap your stack set up 7 series transceiver wizard configured for 10Gbase-R with the asynchronous 64/66b gearbox set up the interface and protocols using your stack probably tinker with the SFP+ module to actually switch on, and maybe some clocking issues (KC705 has a weird clock for SFP+, not sure if you use that or pull a clock from somewhere else) and hope for some wireshark traffic

21:48 <azonenberg> Pretty much. There is a full TCPIPStack module that integrates all of the various protocol components if you want to use that for starting out

21:48 <d1b2> <johnsel> sound correct to you?

21:49 <azonenberg> you just have to instantiate the serdes wizard, the mac/pcs, and the stack and bolt them together

21:49 <d1b2> <johnsel> Cool. I'll let you know how far I get, I'm receiving the SFP+ PCIe module tomorrow and some transceivers and fiber

21:50 <azonenberg> I also have a 1000base-X core as well BTW

21:50 <azonenberg> which you can use with a GTX or GTP in 8b10b mode

21:51 <d1b2> <johnsel> Might be a good one to keep in de debugging toolkit if nothing goes as it should

21:52 <azonenberg> and then i have GMII and RGMII support of course

21:52 <azonenberg> and experimental SGMII. The 1000base-X block should support SGMII over a GTP no problem today (although this has never been tested)

21:52 <azonenberg> and it also should in theory work over ISERDES/OSERDES oversampling, but i had hardware problems on my last board that used it

21:53 <azonenberg> and the switch has two SGMII PHYs that i plan to use to continue testing this

21:53 <azonenberg> I also have QSGMII support using a GTP/GTX, which is broken out to four SGMII lanes with their own MACs. this is very lightly simulation tested but has never been tested in hardware

21:53 <azonenberg> hopefully that will begin this weekend once i stuff the other side of this board

21:54 <d1b2> <johnsel> cool, l've been seeing incremental progress on your mastodon

21:57 <azonenberg> Yep. All of these projects tie into each other

21:57 <azonenberg> the whole reason scopehal has so many networking protocol decodes is so that i can do debug and verification on the switch

21:58 <azonenberg> and i got into high speed networking so i could better build infrastructure to run high performance data acquisition

21:59 <azonenberg> and i got into high speed probing so i could collect waveforms to debug both of the above

21:59 <azonenberg> lol

22:02 <d1b2> <johnsel> recursive improvement

22:03 <d1b2> <johnsel> same-ish story here though. I wanted to do something high-speed. But to do high-speed you need an oscilloscope, thus i'm building a high-speed oscilloscope. Now I am working on the oscilloscope I have need for faster interfaces so I am looking at 10GBase

22:04 <d1b2> <johnsel> Although I was starting from 0, you started with some nice measurement capability already. But I like bootstrapping projects

22:08 <azonenberg> i mean that was the inspiration for FREESAMPLE

22:09 <azonenberg> Which i still want to build at some point

22:09 <azonenberg> (open hardware 10 GHz sampling scope)