#scopehal on 2024-01-31 — irc logs at libera.irclog.whitequark.org

2023-10-21 05:40 azonenberg changed the topic of #scopehal to: ngscopeclient, libscopehal, and libscopeprotocols development and testing | https://github.com/ngscopeclient/scopehal-apps | Logs: https://libera.irclog.whitequark.org/scopehal

00:01 <bvernoux> yes

00:01 <bvernoux> it was always crashing

00:01 <d1b2> <david.rysk> I will investigate here

00:01 <d1b2> <david.rysk> but yeah I'm rewriting all the CMake scripts and writing CI and all

00:01 <bvernoux> it was random depending on build but now it always crash because of ffts

00:02 <bvernoux> I have pushed the built files https://hydrabus.com/ngscopeclient/ngcopeclient_release_31Jan2024.7z

00:03 <bvernoux> It contains all dependencies dll

00:10 bvernoux has quit [Quit: Leaving]

00:13 <d1b2> <david.rysk> @bvernoux any reason you're not using the ffts from the repos?

01:10 <d1b2> <david.rysk> anyway it crashes for me, I will investigate

01:34 Degi_ has joined #scopehal

01:34 Degi has quit [Ping timeout: 255 seconds]

01:34 Degi_ is now known as Degi

02:21 <d1b2> <david.rysk> @bvernoux found the bug (not at all related to ffts), preparing a PR

02:25 <_whitenotifier-3> [scopehal] d235j opened pull request #849: Fix next_pow2 for ILP64 (Windows) - https://github.com/ngscopeclient/scopehal/pull/849

02:27 <_whitenotifier-3> [scopehal] d235j synchronize pull request #849: Fix next_pow2 for ILP64 (Windows) - https://github.com/ngscopeclient/scopehal/pull/849

02:33 <d1b2> <david.rysk> @bvernoux and there's the PR

02:39 <_whitenotifier-c> [scopehal] d235j edited pull request #849: Fix next_pow2 for ILP64 (Windows) - https://github.com/ngscopeclient/scopehal/pull/849

02:51 <_whitenotifier-3> [scopehal] d235j edited pull request #849: Fix next_pow2 for LLP64 (Windows) - https://github.com/ngscopeclient/scopehal/pull/849

02:52 <_whitenotifier-3> [scopehal] d235j synchronize pull request #849: Fix next_pow2 for LLP64 (Windows) - https://github.com/ngscopeclient/scopehal/pull/849

03:14 <_whitenotifier-c> [scopehal] azonenberg closed pull request #849: Fix next_pow2 for LLP64 (Windows) - https://github.com/ngscopeclient/scopehal/pull/849

03:14 <_whitenotifier-3> [scopehal] azonenberg pushed 2 commits to master [+0/-0/±2] https://github.com/ngscopeclient/scopehal/compare/00f827f37332...c2b1c25359cc

03:14 <_whitenotifier-c> [scopehal] azonenberg c2b1c25 - Merge pull request #849 from d235j/fix-windows-next_pow2 Fix next_pow2 for LLP64 (Windows)

03:15 <_whitenotifier-3> [scopehal-apps] azonenberg pushed 1 commit to master [+0/-0/±1] https://github.com/ngscopeclient/scopehal-apps/compare/110a14231025...a82cccd3cff8

03:15 <_whitenotifier-c> [scopehal-apps] azonenberg a82cccd - Updated submodules

04:27 <_whitenotifier-c> [scopehal-apps] azonenberg closed issue #680: Filter expressions in protocol analyzer are not serialized to scopesessions - https://github.com/ngscopeclient/scopehal-apps/issues/680

04:27 <_whitenotifier-3> [scopehal-apps] azonenberg pushed 1 commit to master [+0/-0/±3] https://github.com/ngscopeclient/scopehal-apps/compare/a82cccd3cff8...eb32f9c5ec1f

04:27 <_whitenotifier-c> [scopehal-apps] azonenberg eb32f9c - ProtocolAnalyzerDialog: serialize current filter expression. Fixes #680.

05:22 <_whitenotifier-c> [scopehal-apps] azonenberg pushed 1 commit to master [+0/-0/±1] https://github.com/ngscopeclient/scopehal-apps/compare/eb32f9c5ec1f...40ac34dc956f

05:22 <_whitenotifier-3> [scopehal-apps] azonenberg 40ac34d - Allow protocol overlays and spectrograms to be stacked regardless of which one is being dragged

12:13 bvernoux has joined #scopehal

13:45 _whitelogger has joined #scopehal

14:23 bvernoux has quit [Quit: Leaving]

15:35 <d1b2> <johnsel> @azonenberg did you end up rebooting the server?

16:09 <d1b2> <azonenberg> since we had the pcie passthru issues? no i was busy on something else last night

16:09 <d1b2> <azonenberg> remind me later today and i can give it a try

16:09 <d1b2> <david.rysk> A reboot will probably fix it; if you want more in-depth troubleshooting set me up with access and let me know 😛

16:09 <d1b2> <azonenberg> Yes david's vpn access is also still pending

16:09 <d1b2> <azonenberg> i have the certificate open on my CA system and have been too busy with other stuff to sign it and send back :p

16:10 <d1b2> <david.rysk> also I have Windows CI working, and the CMake changes work on Windows pretty much verbatim. Have to fix up packaging and do some more cleanup

16:10 <d1b2> <johnsel> you'd get only permissions on the XOA system, not the xen. Unless azonenberg wants to open up the whole system for you

16:10 <d1b2> <david.rysk> I mean I can work with azonenberg to troubleshoot the PCIe stuff if he's not busy

16:10 <d1b2> <david.rysk> but he's always busy :p

16:10 <d1b2> <johnsel> but we agreed that we wouldn't do that between me & azonenberg

16:10 <d1b2> <johnsel> sure, me too

16:10 <d1b2> <johnsel> it's his lack of time that is the problem

16:11 <d1b2> <johnsel> that said, whoever fixes it is fine for me

16:11 <d1b2> <david.rysk> The hardware that I have wouldn't give us a real benefit over GH-hosted CI for the size of the project at the moment

16:11 <d1b2> <johnsel> if you're more familiar with the xen backend

16:11 <d1b2> <johnsel> I am not very, I have ran a local instance of XCP-ng for a while to know what is where but otherwise I am not

16:12 <d1b2> <johnsel> Sure, and we want to set up actual hw in the near future

16:13 <d1b2> <johnsel> I said from the beginning he should have bought a separate system that he can just give me KVM access too but that was not feasible

16:13 <d1b2> <johnsel> it would have made things a whole lot easier

16:14 <d1b2> <johnsel> we can also go the Docker route if you want to collaborate on that @david.rysk

16:14 <d1b2> <david.rysk> We can, we just need a solid, working VM

16:14 <d1b2> <johnsel> I initially wanted a single system (terraform) to handle both windows and linux

16:14 <d1b2> <johnsel> and can eventually do the same for osx

16:14 <d1b2> <david.rysk> Docker should be pretty straightforward, but I'm not sure how that interacts with GH Actions, is there a guide?

16:14 <d1b2> <johnsel> there is an app

16:14 <d1b2> <johnsel> let me link it

16:14 <d1b2> <david.rysk> for macOS you'll need a Mac

16:15 <d1b2> <david.rysk> Terraform from Hashicorp? Note potential licensing concerns

16:15 <d1b2> <johnsel> https://github.com/actions/actions-runner-controller

16:15 <d1b2> <david.rysk> aah k8s

16:15 <d1b2> <johnsel> why?

16:15 <d1b2> <johnsel> the main app is open source

16:16 <d1b2> <johnsel> and we don't publish it so whatever license they use is fine

16:16 <d1b2> <david.rysk> no, it's shared-source under BSL

16:16 <d1b2> <david.rysk> but I guess we'd be ok, still

16:16 <d1b2> <johnsel> it's fine, nobody gets to interact with it

16:17 <d1b2> <david.rysk> @azonenberg does xoa support nested virtualization in your environment?

16:17 <d1b2> <johnsel> anyway yes their recommended autoscaler for docker would need a k8s backend

16:17 <d1b2> <johnsel> it does

16:17 <d1b2> <johnsel> but not with pcie passthrough

16:18 <d1b2> <johnsel> and azonenberg barely knows anything about the ci haha

16:18 <d1b2> <johnsel> I can give you access to the repo with the docs and scripts though

16:18 <d1b2> <johnsel> I do insist on an infra as code setup, i.e. we should be able to recreate everything running from scripts without any manual provisioning

16:19 <d1b2> <johnsel> but that's all dealt with in principle for the vms

16:19 <d1b2> <johnsel> just need to add the k8s deployment

16:21 <d1b2> <david.rysk> yeah but we need passthrough

16:21 <d1b2> <johnsel> yes so we can't use nested virtualization

16:21 <d1b2> <johnsel> but we don't need it either for docker

16:21 <d1b2> <david.rysk> true

16:22 <d1b2> <johnsel> I had wished we could do it with hyper-v

16:22 <d1b2> <david.rysk> that's probably the way I'd go then

16:22 <d1b2> <johnsel> that would have been great

16:22 <d1b2> <david.rysk> I'm more used to kvm (e.g. Proxmox)

16:22 <d1b2> <johnsel> yeah in theory xen can do kvm as well but we're fairly limited by the xoa interface

16:23 <d1b2> <johnsel> it's an abstraction that implements the auth layer so andrew can have his private and work vms separated properly from our ci stuff

16:23 <d1b2> <johnsel> that was a hard requirement

16:23 <d1b2> <david.rysk> yeah and Andrew already uses xen

16:23 <d1b2> <david.rysk> so using something else isn't an option

16:23 <d1b2> <johnsel> which is why we went xcp-ng+xoa

16:24 <d1b2> <johnsel> it was the closest to what he already had

16:24 <d1b2> <johnsel> we considered other platforms but the GPU passthrough benefits weren't clear and it would have been a lot of work to port

16:24 <d1b2> <johnsel> in retrospect now we're dealing with these esoteric issues it might have been the right choice but hindsight is 20/20

16:24 <d1b2> <david.rysk> I'd probably only consider Proxmox as an alternative here

16:25 <d1b2> <johnsel> yes esxi would have been ideal but I think we had some license limitations that made it not feasible

16:26 <d1b2> <johnsel> in my experience it's vGPU support is fairly rock solid

16:26 <d1b2> <johnsel> but we had both linux and windows w/ GPUs working previously so hopefully with a reboot we can just deploy them once and keep them running

16:26 <d1b2> <johnsel> and accept that for now we can't have clean systems for every build

16:27 <d1b2> <johnsel> at least for windows

16:27 <d1b2> <johnsel> for linux things are a little simpler

16:27 <d1b2> <johnsel> I used to manage weird embedded docker stuff back when I was still working full time

16:28 <d1b2> <johnsel> basically a small datacenter on a NUC w/ full remote access and firmware update ability

16:28 <d1b2> <johnsel> a whole fleet of them

16:29 <d1b2> <johnsel> I was at some point working on a system that would work with a fixed pxe endpoint to boot from and then fully auto-provision itself on the fly

16:32 <_whitenotifier-c> [scopehal] azonenberg pushed 1 commit to master [+0/-0/±2] https://github.com/ngscopeclient/scopehal/compare/c2b1c25359cc...495cad3d68a5

16:32 <_whitenotifier-3> [scopehal] azonenberg 495cad3 - Fixed regression from adding 0x prefix that broke SI scaling prefixes

16:32 <_whitenotifier-c> [scopehal-apps] azonenberg pushed 1 commit to master [+0/-0/±1] https://github.com/ngscopeclient/scopehal-apps/compare/40ac34dc956f...983d6ad5402f

16:32 <_whitenotifier-3> [scopehal-apps] azonenberg 983d6ad - Updated to latest scopehal

16:37 <d1b2> <johnsel> pm @david.rysk

17:31 <d1b2> <bvernoux> I confirm it fix the crash issue with Demo Oscilloscope on Windows

17:32 <azonenberg> awesome

17:32 <d1b2> <david.rysk> awesome, I've been testing on Windows here

17:32 <d1b2> <bvernoux> I have uploaded my full built with bin stripped https://hydrabus.com/ngscopeclient/ngcopeclient_release_strip_Win64_bin_31Jan2024_13h24.7z

17:32 <d1b2> <david.rysk> oh my prelim install instructions for msys2 are just

17:32 <d1b2> <david.rysk> install these dependencies:

17:33 <d1b2> <david.rysk> mingw-w64-ucrt-x86_64-cmake mingw-w64-ucrt-x86_64-toolchain mingw-w64-ucrt-x86_64-libsigc++ mingw-w64-ucrt-x86_64-cairomm mingw-w64-ucrt-x86_64-gtkmm3 mingw-w64-ucrt-x86_64-yaml-cpp mingw-w64-ucrt-x86_64-glfw mingw-w64-ucrt-x86_64-catch mingw-w64-ucrt-x86_64-vulkan-headers mingw-w64-ucrt-x86_64-vulkan-loader mingw-w64-ucrt-x86_64-shaderc mingw-w64-ucrt-x86_64-glslang mingw-w64-ucrt-x86_64-spirv-tools mingw-w64-ucrt-x86_64-ffts

17:33 <d1b2> <david.rysk> then cmake .. / make

17:33 <d1b2> <david.rysk> the vulkan SDK being set up might mess it up, and this might need my CMake work, I'll do more testing in the near future

17:33 <d1b2> <david.rysk> but basically we can and should avoid the Vulkan SDK entirely when building with MinGW

17:34 <d1b2> <johnsel> I think we had a lot of issues with glslang(?) missing on windows

17:34 <d1b2> <david.rysk> since (1) MinGW includes all the needed packages and (2) C++ ABI is incompatible between MinGW and MSVC (which is what the Vulkan SDK is built with)

17:35 <d1b2> <david.rysk> I ran into C++ linkage problems when I tried using my CMake changes, which make it actually try to link to the the SDK if the SDK is installed

17:35 <d1b2> <david.rysk> which is expected as MinGW does not use VC++ C++ libs at all

17:35 <d1b2> <david.rysk> the C++ ABI is just completely different

17:35 <d1b2> <johnsel> Yeah it's clunky, we ideally wanted to drop msys2 entirely and build w/ msvc instead

17:36 <d1b2> <david.rysk> for that I'd want to dump the requirements for gtkmm/cairomm

17:36 <d1b2> <johnsel> that gives a much more expected environment for windows devs as well

17:36 <d1b2> <david.rysk> since they complicate matters

17:36 <d1b2> <johnsel> yep that's on the list

17:36 <d1b2> <david.rysk> ffts too but ffts isn't a real problem (heh!)

17:36 <d1b2> <johnsel> yes those 3 are annoying, the first 2 being real blockers

17:36 <d1b2> <johnsel> we've (that is azonenberg mostly) have been working those dependencies out of the codebase

17:37 <d1b2> <johnsel> with the goal to lose msys2 entirely eventually

17:37 <d1b2> <david.rysk> I started looking at using fftw; fftw has some annoying bugs in its packaging (and hasn't had a release in too long) but I should be able to work around that

17:37 <d1b2> <johnsel> I actually think we went from fftw to ffts

17:37 <d1b2> <johnsel> or it was only considered at some time

17:37 <d1b2> <david.rysk> azonenberg wants to use fftw for the tests

17:38 <d1b2> <david.rysk> fftw is GPL which makes it undesireable for the entire project

17:38 <d1b2> <johnsel> yeah I'm not sure why we want to keep that

17:38 <d1b2> <johnsel> I guess to double check the output

17:38 <d1b2> <david.rysk> otherwise I'd look at integrating RustFFT but that means writing some interfacing code (and needing a supported Rust compiler, but at this point all distros seem to have one)

17:38 <d1b2> <johnsel> I mean the obvious choice is to just use vkFFT imo

17:39 <d1b2> <david.rysk> yeah that's the idea, use vkFFT in the project, use something else for the tests

17:39 <d1b2> <johnsel> personally I'd use vkfft everywhere

17:39 <d1b2> <johnsel> they have enough tests on their end

17:39 <d1b2> <johnsel> but that's just what I'd do

17:40 <d1b2> <johnsel> I definitely wouldn't employ rust for it

17:40 <d1b2> <johnsel> We're trying to lean down, not bulk up the build process

17:40 <d1b2> <david.rysk> interestingly the build system stuff there is pretty mature/robust

17:40 <d1b2> <johnsel> sure but it's also not developer friendly

17:41 <d1b2> <johnsel> e.g. Mark would love to just have a project that he can open in visual studio to work on Windows support

17:41 <d1b2> <johnsel> having lots of dependencies to set up the build process makes the barrier for people to contribute much higher

17:42 <d1b2> <johnsel> e.g. I regularly want to do some work on a driver but then I have to go through the whole set up with msys2 etc and then I can't copy the commands from the docs and neither from the CI at this point

17:42 <d1b2> <johnsel> so then I need to find the sources of the docs, but the latex syntax makes those annoying to copy paste as well

17:43 <d1b2> <johnsel> by the time I have everything set up I've went through so much effort I don't even want to code anymore

17:43 <d1b2> <david.rysk> yeah I have a WIP docs improvement that will come along with the CMake PR

17:43 <d1b2> <johnsel> it should be as simple as git pull and replicate the ci commands

17:43 <d1b2> <david.rysk> that's kinda the whole point of the CMake work I'm doing

17:43 <d1b2> <david.rysk> fix all this

17:44 <d1b2> <johnsel> I understand and I think it's great that you do. I just wanted to give my perspective on what I think is not ideal for the project

17:44 <d1b2> <david.rysk> see what it should be is: go here and install the package (for CMake, etc)

17:44 <d1b2> <david.rysk> then when you run CMake, it automatically finds everything that has been installed

17:44 <d1b2> <david.rysk> but that will have to come with the MSVC overhaul

17:44 <d1b2> <david.rysk> you will still need to run CMake, but at that point you can even run it from the GUI

17:45 <d1b2> <johnsel> yeah but cmake I think is expected and fine

17:45 <d1b2> <david.rysk> I'm also going to look at using vcpkg to handle pulling dependencies

17:45 <d1b2> <david.rysk> (for MSVC)

17:45 <d1b2> <johnsel> I think that's a good idea, I wanted to do the same

17:45 <d1b2> <johnsel> I made a start on it

17:45 <d1b2> <johnsel> I don't know if I have the files still

17:45 <d1b2> <johnsel> I'll check

17:45 <d1b2> <david.rysk> It doesn't make sense until we have the code working on MSVC with manual deps

17:46 <d1b2> <johnsel> but yes that's a good choice I think then Windows people can use their expected Windows tools

17:46 <d1b2> <david.rysk> anyway

17:46 <d1b2> <david.rysk> did you look at my MSYS CI config?

17:46 <d1b2> <johnsel> No I went through that to figure out where the build fails on

17:46 <d1b2> <johnsel> I did not yet

17:46 <d1b2> <johnsel> Do you have a link?

17:46 <d1b2> <david.rysk> https://github.com/d235j/scopehal-apps/blob/master/.github/workflows/build-windows.yml

17:46 <d1b2> <david.rysk> it's much simpler than the Linux ones

17:46 <d1b2> <david.rysk> oh wait

17:46 <d1b2> <david.rysk> that's the wrong branch

17:47 <d1b2> <david.rysk> https://github.com/d235j/scopehal-apps/blob/ci-work/.github/workflows/build-windows.yml

17:47 <d1b2> <david.rysk> the simplified MSYS instructions are literally: install MSYS2 install list of packages cmake .. make

17:47 <d1b2> <johnsel> I like that, needs vulkan still

17:47 <d1b2> <david.rysk> it's pulling vulkan from the MSYS2 repos

17:47 <d1b2> <johnsel> Please make it copy-pasteable though

17:48 <d1b2> <johnsel> oh great

17:48 <d1b2> <johnsel> then I love it

17:48 <d1b2> <david.rysk> I'm not sure I follow how these are not copy-pasteable?

17:48 <d1b2> <david.rysk> oh I'm not even using env.VULKAN_SDK_VERSION in this one

17:48 <d1b2> <johnsel> I thought it needed Vulkan still

17:48 <d1b2> <johnsel> which I thought you'd implement with that annoying template var

17:48 <d1b2> <johnsel> I really hate those with passion.

17:48 <d1b2> <david.rysk> I'm not sure I follow why

17:49 <d1b2> <johnsel> But this is great yes

17:49 <d1b2> <johnsel> Well the CI system produces essentially the golden copy of the application on a certain platform

17:49 <d1b2> <johnsel> Ideally that means that a developer on that platform can just copy the commands from the CI script locally

17:50 <d1b2> <johnsel> I think it's a mentality difference between software people and EEs who are used to look at documentation for what to do

17:50 <d1b2> <david.rysk> imo the CI scripts should not be the documentation

17:50 <d1b2> <johnsel> For cloud stuff it's expected that you can just run the CI commands locally and get the same result

17:50 <d1b2> <david.rysk> and it's our job to update the documentation to match the CI scripts

17:51 <d1b2> <johnsel> Like I said, I think that's a mentality difference. Where I come from the CI builds are the golden copy and you want to be able to replicate them. WIthout need to refer to documentation

17:51 <d1b2> <david.rysk> there are too many places where you need to use these env vars 😦

17:51 <d1b2> <johnsel> So in essence they are documentation for the build process

17:52 <d1b2> <johnsel> I'll see if you can export them once or so and at least minimize the need to modify a script

17:52 <d1b2> <david.rysk> I guess you can do that in some places

17:52 <d1b2> <johnsel> Dinner is ready though so it will have to wait until after

17:52 <d1b2> <david.rysk> but like, for msys to properly do matrix build for all the configs, I need to make the list of packages have

17:52 <d1b2> <david.rysk> e.g. mingw-w64-${{ matrix.env }}-x86_64-toolchain

17:53 <d1b2> <david.rysk> or I can use pacboy

17:53 <d1b2> <johnsel> I mean you don't have to

17:53 <d1b2> <david.rysk> but then the user will need to know to install pacboy

17:54 <d1b2> <johnsel> pacman?

17:54 <d1b2> <david.rysk> no

17:54 <d1b2> <david.rysk> https://www.msys2.org/docs/package-naming/#avoiding-writing-long-package-names

17:54 <d1b2> <david.rysk> it handles the prefix

17:54 <d1b2> <johnsel> Well I'd love to discuss further after my dinner. But imo verbosity in these scripts is only desirable

17:54 <d1b2> <david.rysk> sure

17:57 <d1b2> <david.rysk> we probably want to support windows-on-ARM too...

17:57 <d1b2> <david.rysk> which means we'd need a device for that 😛

18:04 <d1b2> <david.rysk> also regarding Rust, there's this CMake tool called Corrosion that handles the entire "build lib into .dll/.so/.dylib" part for you

18:04 <d1b2> <david.rysk> but yeah that's lower priority

18:05 <d1b2> <azonenberg> We have had bugs in the past in which incorrect usage of vkFFT (e.g. padding settings) caused garbage output

18:05 <d1b2> <azonenberg> I want a golden FFT implementation in the unit tests to catch such bugs

18:05 <d1b2> <david.rysk> seems the go-to device is the $699 Windows Dev Kit 2023, but it's SLOW

18:05 <d1b2> <azonenberg> and that implementation being GPL is a non-issue

18:05 <d1b2> <azonenberg> Hence looking at FFTW

18:05 <d1b2> <david.rysk> a VM on a Mac will be significantly better performing

18:05 <d1b2> <azonenberg> vkFFT for all actual library/application code is a given

20:40 <d1b2> <azonenberg> @johnsel just got a day-old debian ci build failure

20:40 <d1b2> <david.rysk> yeah I've been getting the same, I have abandoned the selfhosted CI for now, until we can get it more stable

20:40 <d1b2> <david.rysk> IMO we should just have one VM and use lightweight containers (docker) inside

20:40 <d1b2> <david.rysk> though then we have to see about GPU passthru

20:41 <d1b2> <david.rysk> docker claims to have some sort of GPU passthru support

20:50 <_whitenotifier-3> [scopehal-apps] azonenberg 2860cae - Scrolling cleanup

20:50 <_whitenotifier-c> [scopehal-apps] azonenberg pushed 1 commit to master [+0/-0/±1] https://github.com/ngscopeclient/scopehal-apps/compare/983d6ad5402f...2860cae94137

20:52 <d1b2> <david.rysk> btw if someone wants to download these builds that I'm doing on the CI, https://github.com/d235j/scopehal-apps/actions/runs/7731824532