#scopehal on 2023-08-24 — irc logs at libera.irclog.whitequark.org

2022-03-25 21:41 azonenberg changed the topic of #scopehal to: libscopehal, libscopeprotocols, and glscopeclient development and testing | https://github.com/glscopeclient/scopehal-apps | Logs: https://libera.irclog.whitequark.org/scopehal

01:34 Degi_ has joined #scopehal

01:35 Degi has quit [Ping timeout: 248 seconds]

01:35 Degi_ is now known as Degi

02:51 <_whitenotifier> [scopehal-apps] eyecan forked the repository - https://github.com/eyecan

04:29 Stary_ has joined #scopehal

04:29 Fridtjof_ has joined #scopehal

04:35 Fridtjof has quit [*.net *.split]

04:35 Stary has quit [*.net *.split]

05:11 <d1b2> <hansemro> Sorry for delays on finalizing PRs. I am nearly finished working on AVX optimizations for Siglent BIN Import. AVX2 optimization for processing digital samples is done, but could use some work/feedback.

05:12 <azonenberg> Oh yay the IRC-discord bridge is back up

05:12 <azonenberg> No worries, i have my hands full with other stuff

05:12 <azonenberg> I'm in the middle of doing some probe testing and working with some artists to improve the appearance of the filter graph editor

05:12 <d1b2> <hansemro> https://gist.github.com/hansemro/81f7cd5330c92e53b37c69829d78e4e4 for prototype AVX2 work for converting 1-bit digital samples to bool array

05:13 <azonenberg> How much of a speedup are you seeing?

05:13 <d1b2> <hansemro> 1.6x-2x

05:14 <d1b2> <hansemro> for large sample size

05:15 <d1b2> <hansemro> Needed to add two more vector instructions to cleanup the byte array. Not sure if this is a bool cast issue. https://gist.github.com/hansemro/81f7cd5330c92e53b37c69829d78e4e4#file-dsconvert-cpp-L120-L124

05:16 <azonenberg> I'd have to take a bit more time to look at that and find out. What i can say is, i'm impressed, i think you're the first contributor we've had doing vector optimization other than myself lol

05:17 <d1b2> <hansemro> scopehal dev branch with digital signal processing optimization: https://github.com/hansemro/scopehal/tree/siglent-bin-import-avx2

05:22 <d1b2> <hansemro> Learning AVX2 was quite interesting. Most of the work is in figuring out how to vectorize the process...

05:22 <azonenberg> Yep

05:22 <azonenberg> The other thing is, we are looking at moving some of that processing to GPU long term

05:22 <azonenberg> Since ultimately thats where we want to do as much data-parallel work as we can

05:22 <azonenberg> So far not a whole lot of filters have GPU implementations but i want to expand that

05:23 <d1b2> <hansemro> Seems interesting. I need to look more into GPU compute since it is the same SIMD concept

05:24 <azonenberg> Yeah. Look at the FIR filter shader if you want to get a quick overview of what it looks like

10:16 bvernoux has quit [Quit: Leaving]

16:59 <_whitenotifier> [scopehal] azonenberg opened issue #792: Eye pattern: add BER contour support - https://github.com/glscopeclient/scopehal/issues/792

16:59 <_whitenotifier> [scopehal] azonenberg labeled issue #792: Eye pattern: add BER contour support - https://github.com/glscopeclient/scopehal/issues/792

17:02 <_whitenotifier> [scopehal] azonenberg opened issue #793: Eye width/height measurements: allow specifying target BER for eye opening - https://github.com/glscopeclient/scopehal/issues/793

17:02 <_whitenotifier> [scopehal] azonenberg labeled issue #793: Eye width/height measurements: allow specifying target BER for eye opening - https://github.com/glscopeclient/scopehal/issues/793

17:15 <_whitenotifier> [scopehal-apps] azonenberg opened issue #603: Display measurement as a sink in the filter graph - https://github.com/glscopeclient/scopehal-apps/issues/603

17:15 <_whitenotifier> [scopehal-apps] azonenberg labeled issue #603: Display measurement as a sink in the filter graph - https://github.com/glscopeclient/scopehal-apps/issues/603

17:15 <_whitenotifier> [scopehal-apps] azonenberg labeled issue #604: Add tooltip to eye pattern showing instantaneous BER at cursor position - https://github.com/glscopeclient/scopehal-apps/issues/604

17:15 <_whitenotifier> [scopehal-apps] azonenberg opened issue #604: Add tooltip to eye pattern showing instantaneous BER at cursor position - https://github.com/glscopeclient/scopehal-apps/issues/604

19:27 <d1b2> <hansemro> @azonenberg Looking at QueueManager, I see that it is sorting queues in ascending order of feature flag count. This often means prioritizing queues with graphics capabilities. However, the first selected queue is not a graphics queue, but a transfer queue (see VulkanInit). On AMD integrated GPUs, where there is only 1 graphics capable queue, QueueManager will end up reusing the same graphics queue for rendering and g_vkTransferQueue. If

19:27 <d1b2> instead, we reversed the sort order (in descending order of feature flag count), this reuse does not happen and we can reserve graphics queue when they are needed for rendering.

19:29 <azonenberg> Hmm, that makes sense. In general we should try to use the least featureful queue that meets our needs for a given application

19:29 <azonenberg> lain: ^^

19:30 <azonenberg> Do some testing and send a PR

19:30 <azonenberg> in general we have done comparatively little testing on AMD cards since most of the devs have nvidia or apple silicon platforms

19:30 <azonenberg> Improved APU / unified memory card support is still pending

19:31 <azonenberg> in particular, AcceleratorBuffer does not currently understand that memory can be both host local and device local at the same time in a unified memory SoC

19:31 <azonenberg> so it will allocate two copies of each memory block and create needless copies

19:32 <azonenberg> This is being tracked as https://github.com/glscopeclient/scopehal/issues/681 and currently assigned to lain but she's not actively working on it

19:32 <azonenberg> So if you wanted to spend some time on it, it certainly wouldn't hurt

19:33 <azonenberg> Being a performance issue rather than "totally broken" and not affecting a platform anyone had easy access to for testing, it was lower on the priority list

19:34 <d1b2> <hansemro> Yes, this is pretty low priority. Not experiencing severe performance issues, but wanted to raise some awareness.

19:34 <azonenberg> File a ticket if nothing else

19:35 <azonenberg> Also, the CI stuff has been on hold for quite some time, i have a pair of GPUs sitting on the floor next to my rack that the VM serve ris in

19:35 <azonenberg> where they've been since like april

19:35 <azonenberg> i hope to have time to get back to that soon. things have been hectic and rebooting the vm server is a big annoyance but i'm about due for some hypervisor patches and distro updates on the VMs so a lot of stuff is going to get rebooted soon anyway

19:38 <d1b2> <hansemro> Unrelated: I am interested in picking up the jtaghal project. What are your bsdl parsing needs? I don't have too much experience in writing a lexer and parser, but I want to do some bsdl-IC validation tooling.

19:39 <azonenberg> Jtaghal had really been focused on in circuit debug and test of FPGA stuff, with a bit of ARM on the side

19:39 <d1b2> <hansemro> I see

19:39 <azonenberg> I dont think i've ever actually done actual boundary scan with INTEST/EXTEST

19:39 <azonenberg> So i never put any effort into it

19:40 <azonenberg> I was mostly using it for things like debug of FPGA based stuff using the xilinx USERx instructions, and researching low level ARM debug stuff to study code protection and security mechanisms for work

19:40 <azonenberg> While the project isn't dead, i'm not actively working on it because it does what I need it to at the moment

19:40 <azonenberg> And it never got anywhere near the level of community or adoption as scopehal did

19:41 <d1b2> <hansemro> gotcha

19:41 <azonenberg> That said, one of my mid term TODO items is reverse engineering the xilinx ILA and VIO IP core JTAG protocols

19:41 <azonenberg> and writing a jtaghal + scopehal based driver such that I can interface directly with ILA and VIO blocks a) without having to use vivado for debug and b) use ngscopeclient to read ILA data

19:42 <azonenberg> ultimately i want to be able to do complex cross-trigger setups with an external scope/LA plus an ILA (or several) and do trigger cascade, compare the on-chip view of a signal to the off-chip view and identify the electrical causes of bit errors, etc

19:43 <azonenberg> And poke bits in a VIO from ngscopeclient gui

19:43 <azonenberg> while viewing analog waveforms

19:44 <azonenberg> When fine tuning an FPGA transceiver, i frequently will poke emphasis taps and drive strength in a VIO then take eye measurements with a scope

19:44 <azonenberg> So having that all under one roof would be handy

19:45 <azonenberg> I also want to have a well defined way to access the built in eye scan feature on xilinx FPGAs via scopehal, so i can get a post-equalization BER eye

19:48 <azonenberg> hansemro: anyway, if you want to use jtaghal for your work i won't object to you figuring out a way to bolt in a BSDL parser, and will happily take a PR for it. but i don't consider it a priority at all

19:48 <d1b2> <hansemro> I guess this is also something unsupported by open source series 7 Xilinx FPGA flows? Do you know any projects that use BSCAN blocks directly?

19:49 <azonenberg> I do not know the state of the open flows, i've always used vivado

19:49 <azonenberg> For my thesis, I made heavy use of BSCANs for debug. in fact that was what i originally started jtaghal for since xilinx didn't have any API in ISE/vivado for doing this

19:49 <azonenberg> (this was in part because ChipScope was a paid feature for ISE at the time and vivado came out right before I graduated)

19:50 <azonenberg> i had my own logic analyzer core

19:50 <azonenberg> i also had a layer 2 tunneling mechanism allowing me to push raw frames from my custom NoC over JTAG and into the interconnect fabric on the FPGA

19:50 <azonenberg> on the PC side it was exposed as a TCP socket server

19:50 <azonenberg> you could connect to the server and get a connection object which directly mapped to a virtual bus endpoint on the FPGA

19:51 <azonenberg> and send and receive messages via JTAG as if you were a soft IP on the FPGA

19:51 <azonenberg> and exercise actual gateware at full hardware speed from C++ test cases

20:01 <azonenberg> It was actually barely even JTAG I was using at that point

20:01 <azonenberg> I loaded USER1 into IR, switched to SHIFT-DR state

20:01 <azonenberg> then free-ran TCK while pushing framed data or padding into TDI and getting framed data out TDO

20:01 <azonenberg> so i'd just send zeroes if i had nothing to say, then when i wanted to send a frame I'd send a 55 55 55 D5 preamble followed by the bus transaction lol

20:01 <azonenberg> and then a CRC at the end

20:02 <azonenberg> it was basically slightly abbreviated ethernet framing tunneled over barely-jtag

20:02 <d1b2> <hansemro> I am really amazed by your work. As a recent college grad, I have lots to learn and experience.

20:06 <d1b2> <hansemro> I agree with your sentiment that a device that cannot be tested/debugged is useless. I am trying to force myself to do more pre-design and post-design verification work since no one I know seems to enjoy it.

20:18 <azonenberg> yeah, meanwhile i like building tools

20:18 <azonenberg> almost more than i like building things with said tools lol

21:49 <d1b2> <hansemro> I correct my statement about the QueueManager. It looks like the intended sort is ascending (which is what we want), but the sort is actually descending. So this seems like a bug.

21:50 <d1b2> <hansemro> https://github.com/glscopeclient/scopehal/blob/3d9118656edf0dfee8ae0a5663781a33d4af7e61/scopehal/QueueManager.cpp#L147 says ascending order

22:25 Bird|ghosted has quit [Ping timeout: 250 seconds]

22:29 Bird|otherbox has joined #scopehal

23:01 <_whitenotifier> [scopehal] hansemro opened pull request #794: Correct ascending sort of Vulkan queues by feature flag count - https://github.com/glscopeclient/scopehal/pull/794

23:06 <_whitenotifier> [scopehal] hansemro synchronize pull request #794: Correct ascending sort of Vulkan queues by feature flag count - https://github.com/glscopeclient/scopehal/pull/794

23:06 <_whitenotifier> [scopehal] hansemro edited pull request #794: Correct ascending sort of Vulkan queues by feature flag count - https://github.com/glscopeclient/scopehal/pull/794

23:07 <_whitenotifier> [scopehal] hansemro edited pull request #794: Correct ascending sort of Vulkan queues by feature flag count - https://github.com/glscopeclient/scopehal/pull/794