#scopehal on 2022-04-26 — irc logs at libera.irclog.whitequark.org

2022-03-25 21:41 azonenberg changed the topic of #scopehal to: libscopehal, libscopeprotocols, and glscopeclient development and testing | https://github.com/glscopeclient/scopehal-apps | Logs: https://libera.irclog.whitequark.org/scopehal

00:36 Degi has quit [Ping timeout: 272 seconds]

00:39 Degi has joined #scopehal

04:36 <d1b2> <louis> https://cdn.discordapp.com/attachments/776941750291267595/968369981316272128/unknown.png

04:36 <d1b2> <louis> Still screwin' with it, but here's a sneak peek

04:37 <d1b2> <louis> May change it to instead graph all selected on the same graph widget

04:44 <azonenberg> Nice

04:44 <azonenberg> btw, not sure if you've looked at liblogtools in detail yet

04:44 <azonenberg> but if you wanted to make a log sink that would push bridge console output into the debug stream on the socket

04:44 <azonenberg> that is totally doable

04:45 <azonenberg> logtools supports arbitrarily many sinks that messages can be sent to

04:45 <azonenberg> the only two currently implemented are console and files

04:45 <azonenberg> but that is in no way an architectural limitation

04:45 <azonenberg> just never had a reason to build others

04:56 <azonenberg> @louis: also rather than a "G" button maybe have a column titled "graph" that contains a Gtk::Switch?

04:56 <azonenberg> or a Gtk::CheckButton

04:56 <azonenberg> might look better that way

07:05 massi has joined #scopehal

07:50 nelgau_ has quit [Ping timeout: 250 seconds]

07:56 nelgau has joined #scopehal

10:00 massi has quit [Quit: Leaving]

10:00 bvernoux has joined #scopehal

10:37 azonenberg has quit [Ping timeout: 240 seconds]

10:55 azonenberg has joined #scopehal

12:51 <d1b2> <Aleksorsist> Quick question, does glscopeclient do persistence with the triggered data it gets in, or does it somehow get the persistence data from the scope?

14:54 <azonenberg> aleksorsist: persistence is all implemented in postprocessing based on stacking multiple acquisitions

14:54 <azonenberg> there is currently no support for downloading hardware based persistence from the scope

14:54 <azonenberg> at least Pico has this capability in hardware

14:55 <azonenberg> unsure about other vendors

14:55 <azonenberg> but we don't implement it yet

14:55 <d1b2> <Aleksorsist> Perfect, just got to get the triggering then

14:56 <azonenberg> Long term i would like to support persistence in hardware to get faster update rates and detection of rare glitches etc

14:57 <azonenberg> But the alternative approach is of course just to try to capture waveforms at faster rates and stream to the host

14:57 <azonenberg> which is basically what you're going for

14:58 <azonenberg> as of now we max out at one waveform per GPU frame but i have plans to improve the rendering path to handle frames arriving fastr than they can be displayed

14:58 <azonenberg> such that you get persistence within single GPU frames

14:58 <azonenberg> your hardware, the dslogic stuff, and i think possibly the picoscope 6000e family are the only scopes in the pipeline so far where this might be a problem

14:58 <azonenberg> nothing else is this fast :p

15:11 <azonenberg> (also i will continue to recommend that you push triggering to fpga vs trying to stream it all over the thunderbolt interface)

15:28 <d1b2> <Aleksorsist> That is the obvious (and probably best) solution! Software triggering has proven feasible though and I really want to see what's possible with it

15:29 <d1b2> <Aleksorsist> Worst case is a switch my forever backordered a35t to an a50t and do triggering in that

15:39 <azonenberg> i mean simple edge triggering should be doable in hardware on a 35t

15:39 <azonenberg> just a question of capture buffer for pretrigger etc

15:39 <d1b2> <Aleksorsist> Our 35t is 90% utilized 😬

15:40 <azonenberg> oh fun lol

15:40 <azonenberg> yeah that makes it trickier

15:40 <d1b2> <Aleksorsist> Hard to get both 32 bit wide ddr3l and PCIe dma in

15:41 <d1b2> <Aleksorsist> Had to tweak some non-gui options in the xdma core just to get it to fit lol

15:42 <d1b2> <Aleksorsist> The Datamover IP is bloated and buggy so maybe writing a custom one could free the space needed

15:43 <d1b2> <Aleksorsist> That plus reconfiguring the FPGA when changing triggers could do it. Though PCIe would not like that one bit

15:45 <azonenberg> So, funny you mention that

15:45 <azonenberg> i had been talking with some other folks about "poor man's partial reconfig" for triggers

15:45 <azonenberg> essentially building a state machine containing some SRLC32E's

15:46 <azonenberg> which you serially load with truth tables

15:46 <azonenberg> and then use as LUT5s

15:46 <azonenberg> i've done this in the past for other purposes

15:47 <GyrosGeier> I've written very lean PCIe DMA blocks in VHDL that sit on top of the TLP streaming interface

15:48 <GyrosGeier> omitting things like transfers that are not 8 byte aligned saves a lot

15:49 <GyrosGeier> also, the streaming interface is asynchronous for the most part

15:50 <GyrosGeier> you send a TLP and get a reply at some later point

15:51 <GyrosGeier> (or in the case of DMA writes, you send TLPs)

16:10 <d1b2> <Aleksorsist> Both of these are very interesting

16:11 <d1b2> <Aleksorsist> For the reconfigurable triggering I assume it's a programmable sequence of rising or falling edges

16:12 <d1b2> <Aleksorsist> As for PCIe, streaming could be very promising, as it may eliminate the ddr3 buffer needed by the memory mapped xdma core

16:15 <azonenberg> and well, my plan was actually to create a state machine and a bunch of blocks that could detect various things

16:15 <azonenberg> for example, a clocked deserializer

16:15 <azonenberg> or an edge detector

16:15 <azonenberg> or a window check that looks for levels inside/outside some range

16:15 <azonenberg> etc

16:16 <azonenberg> then you'd be able to trigger on an arbitrary sequence of them, possibly with specific timing

16:18 <GyrosGeier> Aleksorsist, it might also be worthwhile to run the DDR interface in a kind of streaming mode

16:20 <d1b2> <Aleksorsist> @GyrosGeier That was something I considered, but it's only worth it if the PCIe core is streaming as well. And the xdma core streaming implementation is awful, running at around 50MB/s instead of >1GB/s

16:21 <d1b2> <Aleksorsist> @azonenberg A kind of triggering toolbox of things you can chain together arbitrarily, that would be really cool

16:21 <GyrosGeier> the PCIe streaming needs to insert 128 bits of TLP headers every 4kB at least

16:21 <azonenberg> Yeah. i have some notes i threw together

16:21 <azonenberg> but the plan was to try and make this some kind of open standard eventually

20:23 <_florent_> Aleksorsist: PCIe does not require that much buffering, just enough the absorb the temporary PCIe unavailability (which can vary a bit depending the machine). I've always managed to avoid buffering in DRAM on 7-Series FPGAs when doing continuous TX/RX streaming at up to 13/14Gbps in each direction for a Gen2 X4 link.

20:24 <azonenberg> _florent_: oh hey, while you're here

20:24 <_florent_> Aleksorsist: What's your PCIe configuration?

20:24 <azonenberg> I would like to add litescope support to glscopeclient at some point in the indefinite near future

20:24 <azonenberg> if you're interested in helping out with that

20:26 <_florent_> azonenberg: yes sure, but in fact LiteScope if very simple and also a bit limited (since just storing samples in BRAM)

20:26 <azonenberg> of course. but that is the point

20:26 <azonenberg> i want to support cross triggering of ILAs and external scopes

20:26 <azonenberg> in order to do combined debug of on and off chip components

20:26 <_florent_> azonenberg: but that could still be intersting to simplify things and use the internal decoders of GLScopeClient

20:26 <azonenberg> trigger on a checksum failure in the FPGA

20:27 <azonenberg> then jump to the analog PHY capture of the signal and investigate what corrupted the packet

20:27 <azonenberg> etc

20:27 <tnt> possible complication of litescope is that it has a bunch different transports if you want to implement those natively in glscopeclient.

20:27 <azonenberg> tnt: i'm thinking the bridge approach probably makes the most sense

20:27 <azonenberg> have N different backends for the bridge and one socket api to scopehal

20:27 <azonenberg> that's what we've traditionally done for things that did not have a native scpi interface

20:27 <tnt> ok so you can just re-use the litex_server thing.

20:27 <azonenberg> the other reason being, we do not want to require you to link libscopehal against dozens of vendor API blobs

20:28 <azonenberg> when you are only going to use one or two of those instruments

20:28 <azonenberg> we want one portable binary and then you install the bridges required by your hardware if it lacks a native usbtmc, rs232, or ethernet interface

20:29 <d1b2> <Aleksorsist> @florent how much Bram did you need for the buffering? My experiments showed that 512KB was needed, any less would drop samples due to latency caused by the OS. My configuration is a xilinx XDMA core in memory mapped mode reading from ddr3, which is written to by a xilinx datamover block.

20:31 <d1b2> <Aleksorsist> @florent (this would require the largest artix part, so it was more worthwhile to use ddr3)

20:32 <_florent_> azonenberg: to simplify my bridging LiteX <-> GLScopeClient, I started modifying one of your existing driver to create a generic SCPI driver that I can easily control from Python scripts: https://github.com/enjoy-digital/xtrx_julia/tree/master/test/scopehal

20:33 <_florent_> azonenberg: for now it's only supporting analog, but I was planning to add digital support

20:33 <_florent_> azonenberg: that I could then use for LiteScope and would allow me to simply reuse the LiteX infrastructure for the control/capture

20:34 <_florent_> azonenberg: with this driver, I can use very simple python scripts to send data to GLScopeClient: https://github.com/enjoy-digital/xtrx_julia/blob/master/test/test_glscopeclient.py

20:35 <azonenberg> _florent_: Yeah my main interest is on chip debug integrated with external instruments

20:35 <azonenberg> and cross triggering

20:35 <azonenberg> i want to reverse engineer the jtag protocol for the xilinx ila as well

20:35 <azonenberg> so i can integrate with their ip

20:37 <_florent_> Alerksorsist: For Gen2 X4, 13-14Gbps of continuous streaming, I'm using maximum 128KB of buffering for each direction (but 128KB was only required on some specific machines, 64KB is enough on most)

20:38 <d1b2> <Aleksorsist> Florent: what IP is this on for PCIe?

20:38 <azonenberg> litepcie i assume

20:39 <_florent_> yes

20:39 <d1b2> <Aleksorsist> Impressive! I never gave it a try since there was no windows driver

20:40 <_florent_> Aleksorsist: indeed, I was going to ask you the question (if you wanted to windows support)

20:40 <d1b2> <Aleksorsist> Has anyone built a windows driver for it?

20:40 <_florent_> there's only a Linux driver for now

20:42 <d1b2> <Aleksorsist> Florent: Ah, that may be an issue. We're aiming for Linux and Windows. I don't know what getting a windows driver done would entail, but I imagine it would be painful

20:42 <d1b2> <Aleksorsist> Florent: which sucks since it sounds like I could ditch the ddr3 and free up loads of FPGA space by using litepcie

20:44 <_florent_> Aleksorsist: Sorry yes, a windows driver is not planned and I'm not able to estimate the efforts. But in case you can to test LitePCIe on Linux, happy to prepare a design for your hardware.

20:45 <azonenberg> @aleksorsist: might be worth having a chat with @mp about that eventually

20:47 <d1b2> <Aleksorsist> True, it would save me a lot of hassle to use LitePCIe and I could definitely look at commissioning a Windows driver once our campaign is over.

20:48 <azonenberg> anyway, you can always decide later, up to the point that you actually put ram in the sockets

20:48 <azonenberg> or solder it on

20:48 <azonenberg> (i forget if you were going sodimm or chip-down)

20:48 <azonenberg> or even later, you can just not use it

20:48 <d1b2> <Aleksorsist> Florent: I'd really appreciate a demo if you have time!

20:49 <d1b2> <Aleksorsist> Azonenberg: true, choice to pop/depop can be zero hour lol

20:50 <d1b2> <Aleksorsist> Azonenberg: we are chip down

20:51 <_florent_> Aleksorsist: can you share info about your hardware? At minima I need the PCIe pinout to prepare the design, the design could run from the PCIe clock if you don't have other clock for the system.

20:54 <d1b2> <Aleksorsist> Florent: we actually run from the PCIe clock already. Everything is open source under the ThunderScope repo and I can grab you the xcf pinout file if that will help too

21:00 <_florent_> Aleksorsist: ok, thanks. I found the repo, can I use these IO constraints? : https://github.com/EEVengers/ThunderScope/blob/master/Firmware/Artix7_PCIe/PCIe_Test/PCIe_Test.srcs/constrs_1/imports/new/module_io.xdc#L93-L126

21:05 <d1b2> <Aleksorsist> Florent: Those were for some testing done while bringing up the old module, these are the latest: https://github.com/EEVengers/ThunderScope/blob/FW/Aleksa/dso_top_fpga_module/Firmware/Artix7_PCIe/dso_top/dso_top.srcs/constrs_1/imports/new/module_io.xdc

21:06 <d1b2> <Aleksorsist> Florent: Sorry for the messy organization, the repo needs some clean-up which I've had no time for 😅

21:12 <_florent_> Aleksorsist: Thanks. I'll prepare this tomorrow then. I'll pre-generate the FPGA bistream to avoid you the LiteX installation on your side :)

21:14 <d1b2> <Aleksorsist> Florent: Much appreciated :) Also realized that if this works I'll be well within the power budget and won't have to do something very janky to draw more amps (and with less power comes less chunky heatsink over FPGA). I am very excited 😄

21:15 <d1b2> <Aleksorsist> Now if only Windows didn't exist...

21:15 <d1b2> <Aleksorsist> One can dream of a better world that would be

21:23 <azonenberg> lol

21:24 <_florent_> Aleksorsit: :) I could also be interested by a Windows driver for LitePCIe and could probably find some funds to help developing it.

21:27 <azonenberg> Awesome

21:29 bvernoux has quit [Quit: Leaving]

21:39 <_florent_> Aleksorsit: just for info, here is a board designed for a client:

21:40 <_florent_> https://usercontent.irccloud-cdn.com/file/VbzyvCrV/image.png

21:40 <_florent_> and the diagram:

21:40 <_florent_> https://user-images.githubusercontent.com/1450143/103343791-282fd680-4a8d-11eb-82bd-c068ac1ad293.png

21:41 <_florent_> it's able to do 3G-SDI TX/RX on 4 channels without buffering in DDR

21:42 <_florent_> so equivalent of >300MS/s on each channel in your case, so I'm really confident you can avoid the DDR

21:47 <_florent_> It's late here, going to bed. I'll prepare and share the PCIe design tomorrow.

22:22 <azonenberg> yeah streaming is a whole different ballgame. my problem in most of my projects is doing buffered/triggered capture of much higher volumes of data

22:23 <azonenberg> (e.g. 80 Gbps of logic analyzer samples)

22:42 <d1b2> <Aleksorsist> Florent: Sweet looking board! Thanks again for the help 🙂

22:46 <d1b2> <Aleksorsist> Florent: And working together to make a windows driver sounds like an excellent idea

23:38 <_whitenotifier-e> [scopehal-apps] 602p opened pull request #417: Enhancements to AFG support (re:Tek AFG) - https://github.com/glscopeclient/scopehal-apps/pull/417

23:39 <_whitenotifier-e> [scopehal] 602p opened pull request #597: Support Tek MSO56 AFG - https://github.com/glscopeclient/scopehal/pull/597

23:40 <_whitenotifier-e> [scopehal] 602p commented on pull request #597: Support Tek MSO56 AFG - https://github.com/glscopeclient/scopehal/pull/597#issuecomment-1110350624

23:43 <_whitenotifier-e> [scopehal-apps] 602p synchronize pull request #417: Enhancements to AFG support (re:Tek AFG) - https://github.com/glscopeclient/scopehal-apps/pull/417