#scopehal on 2022-10-04 — irc logs at libera.irclog.whitequark.org

2022-03-25 21:41 azonenberg changed the topic of #scopehal to: libscopehal, libscopeprotocols, and glscopeclient development and testing | https://github.com/glscopeclient/scopehal-apps | Logs: https://libera.irclog.whitequark.org/scopehal

03:47 Degi_ has joined #scopehal

03:48 Degi has quit [Ping timeout: 265 seconds]

03:48 Degi_ is now known as Degi

03:59 <_whitenotifier-7> [scopehal] azonenberg pushed 1 commit to master [+0/-0/±1] https://github.com/glscopeclient/scopehal/compare/b4cf3783ddae...62825f3b3581

03:59 <_whitenotifier-7> [scopehal] azonenberg 62825f3 - Disabled GPU acceleration in TRCImportFilter since it seems to be broken. Will investigate after queue refactoring is done since a proper fix depends on a queue allocator.

04:02 <azonenberg> numerical precision issues in waveform rendering are back. unsure if refactoring bug or something else

04:02 <azonenberg> (in glscopeclient)

04:02 <azonenberg> investigating...

04:19 <azonenberg> Seems like it was introduced with the vulkan refactoring. that's a massive enough change it will be hard to bisect etc. Time to hav esome fun lol

04:27 <azonenberg> lain: so i figured out the plotRight issue while looking at this

04:27 <azonenberg> turns out you were doing the right thing

04:28 <lain> :O

04:28 <azonenberg> the texture is drawn over the entire gl viewport with no transformation

04:28 <azonenberg> essentially what we end up doing is, allocate the buffer the size of the whole window

04:28 <azonenberg> but only actually invoke the shader on the first plotRight columns

04:28 <azonenberg> so we just waste a little memory

04:28 <lain> ohh

04:29 <azonenberg> We could fix it, but ngscopeclient eliminates the issue entirely so not worth it

04:29 <azonenberg> right now i'm investigating what looks ilke weird numerical precision issues the GL shader doesnt have though

04:29 <lain> oo

04:30 <d1b2> <azonenberg> https://cdn.discordapp.com/attachments/776941750291267595/1026712786895642714/precision.png

04:30 <d1b2> <azonenberg> https://cdn.discordapp.com/attachments/776941750291267595/1026712814229925899/precision2.png

04:33 <lain> innteresting

04:36 <azonenberg> this is a significant regression so probably worth you spending time troubleshooting if you can stash your WIP mac stuff?

04:36 <azonenberg> i dumped a test case in ark:/tmp/

04:36 <azonenberg> foo.scopesession plus the two .trc files

04:36 <lain> kk

04:37 <lain> I'll look into that tomorrow

04:37 <azonenberg> it should open to a view that shows off the failure mode

04:37 <azonenberg> ok

04:37 <azonenberg> (This is bad enough i'm not going to be demoing it like this on my trip. Worst case i can check out a pre-vulkan commit for that purpose)

04:38 <azonenberg> i just noticed it while going through my library of scopesession's looking for cool things to show

04:46 <lain> yeah this'll be fun to bisect lol

04:49 <azonenberg> (can you grab the test case quick and confirm it opens / exhibits the failure condition for you?)

04:50 <azonenberg> i patched the scopesession to use a relative path for the import files so you'll need to have pwd be whatever you dumped the files into

04:56 <lain> ah, 1sec

04:59 <azonenberg> aaand now qsgmii is segfaulting for unrelated reasons, yaaay

05:01 <lain> we need more tests :P

05:02 <azonenberg> we do :p

05:02 <azonenberg> i have a large corpus of test data but no automated testing attached to it

05:04 <lain> well

05:05 <lain> in glscopeclient on macos on my m1 it hits an unhandled exception

05:05 <lain> so that's fun

05:07 <azonenberg> what's the exception?

05:07 <azonenberg> it's probably failing to find some file because you have it in the wrong directory?

05:07 <azonenberg> like i said i patched the test case to be a relative path

05:08 <lain> nope

05:08 <lain> it's vulkan-related

05:08 <azonenberg> :o

05:08 <lain> also the foo.scopesession was using an absolute path on your /ceph/ but I edited it to use an absolute path to where I dumped the files locally

05:08 <azonenberg> ah i swear i patched it

05:08 <azonenberg> must have been after i uploaded it lol

05:09 <lain> oh, fascinating

05:09 <lain> if I fire up vkconfig, it didn't crash this time

05:09 <lain> plenty of messages to help me narrow down what broke

05:09 <lain> azonenberg: actually my first time loading a scopesession file, should the waveforms show up immediately or do I have to hit a button?

05:10 <azonenberg> it should show up immediately

05:10 <azonenberg> oh wait

05:10 <azonenberg> you might have hit a bug i just patched

05:10 <azonenberg> grab 62825f3 in scopehal

05:11 <azonenberg> (latest)

05:11 <lain> lol

05:11 <lain> does that include my macos patches?

05:11 <azonenberg> that's scopehal side

05:11 <lain> or I guess you mean merge those huh

05:11 <azonenberg> you can cherrypick that commit

05:11 <azonenberg> it shoudlnt conflict

05:12 <azonenberg> so it seems an AcceleratorBuffer containing a std::vector is a bad idea

05:12 <azonenberg> i need to troubleshoot more

05:12 <lain> oh?

05:13 <azonenberg> AcceleratorBuffer generally expects to work with POD datatypes

05:13 <azonenberg> i'm not sure exactly how it's going sideways yet

05:13 <azonenberg> but it probably won't handle anything contianing pointers to itself well

05:13 <azonenberg> for starters

05:13 <lain> ok building glscopeclient with the changes

05:13 <lain> I just did a git merge origin/master because I was already close enough that it was seamless

05:14 <lain> HEY

05:14 <lain> I have waveofrms

05:14 <lain> waveforms*

05:14 <lain> unrelated, the sleepy medicine is kicking in so this is gonna go sideways real fast

05:14 <lain> but yes this appears to exhibit the bug, screenshot inbound to confirm

05:15 <_whitenotifier-7> [scopehal] azonenberg labeled issue #704: Regression: all Ethernet protocol decodes crash - https://github.com/glscopeclient/scopehal/issues/704

05:15 <_whitenotifier-7> [scopehal] azonenberg opened issue #704: Regression: all Ethernet protocol decodes crash - https://github.com/glscopeclient/scopehal/issues/704

05:15 <lain> https://dt.lain.land/Screen%20Shot%202022-10-03%20at%2023.14.42.png

05:15 <lain> pls confirm but I assume C2-C1 shouldn't look like that

05:15 <lain> that would be SubtractFilter, yeah?

05:16 <lain> anyway I'll bisect this on my x86_64 box tomorrow so I don't have to deal with breaking macos compat during the bisect

05:16 <azonenberg> so that is actually a TIE filter

05:17 <azonenberg> looking at clock jitter of C2-C1 wrt the PLL running on them

05:17 <lain> ohh

05:17 <azonenberg> it's a sparse waveform

05:17 <azonenberg> with femtosecond timescale

05:17 <lain> wonderful

05:17 <azonenberg> which means that all of the timestamp values are enormous

05:17 <azonenberg> this used to work, now it doesn't

05:17 <azonenberg> if i had to guess, something is being cast to fp32 that used to be done in the int64 domain

05:18 <lain> probably a bug in the shader or invoking it is my guess, but yeah, bisect will tell all

05:18 <azonenberg> we need to be extremely careful with type promotions when working with timestamps

05:18 <lain> aye

05:18 <azonenberg> keep as much as possible in the int64 domain and convert to float for rendering at the last possible moment

05:18 <lain> it's also plausible something behaves differently in vulkan

05:18 <lain> shaders

05:18 <azonenberg> That is also plausible

05:18 <lain> anyway, I'll get the nvidia debug stuff installed on my amd box and get this sorted :3

05:18 <azonenberg> anyway, i dont care the bug is in your changes or vulkan's handling of math or what

05:18 <azonenberg> but yeah

05:19 <lain> but for now, the sleepy meds have kicked in so I'm lucky to be typing words :P

05:19 <lain> nini

06:44 massi has joined #scopehal

08:06 tiltmesenpai has quit [Read error: Connection reset by peer]

08:07 tiltmesenpai has joined #scopehal

11:09 <_whitenotifier-7> [scopehal-apps] RX14 commented on issue #507: Link error with reflowmon - https://github.com/glscopeclient/scopehal-apps/issues/507#issuecomment-1266784220

14:19 massi has quit [Remote host closed the connection]

14:20 massi has joined #scopehal

15:19 bvernoux has joined #scopehal

15:31 <_whitenotifier-7> [scopehal] azonenberg pushed 1 commit to master [+0/-0/±2] https://github.com/glscopeclient/scopehal/compare/62825f3b3581...42791c660448

15:31 <_whitenotifier-7> [scopehal] azonenberg 42791c6 - AcceleratorBuffer: allow non trivially copyable types, but only in CPU-only buffers

15:36 * lain wakes up

15:48 <_whitenotifier-7> [scopehal] azonenberg pushed 1 commit to master [+0/-0/±1] https://github.com/glscopeclient/scopehal/compare/42791c660448...3aeb074fe095

15:48 <_whitenotifier-7> [scopehal] azonenberg 3aeb074 - VectorFrequencyFilter: fixed missing resize call

15:55 <_whitenotifier-7> [scopehal] azonenberg closed issue #704: Regression: all Ethernet protocol decodes crash - https://github.com/glscopeclient/scopehal/issues/704

15:55 <_whitenotifier-7> [scopehal] azonenberg commented on issue #704: Regression: all Ethernet protocol decodes crash - https://github.com/glscopeclient/scopehal/issues/704#issuecomment-1267221287

16:07 <lain> hm ok

16:07 <lain> well

16:07 <lain> my x86_64 box isn't rendering *anything* when I open that scopesession from master, so that's neat :D

16:09 <azonenberg> lol lovely

16:09 <azonenberg> let me check what happens for me

16:11 <_whitenotifier-7> [scopehal-apps] azonenberg pushed 4 commits to master [+2/-0/±32] https://github.com/glscopeclient/scopehal-apps/compare/1212cb5bced6...d8627a0b1443

16:11 <_whitenotifier-7> [scopehal-apps] azonenberg 6199908 - Initial implementation of preferences dialog in ngscopeclient. Supports boolean and enum preferences only. See #522.

16:11 <_whitenotifier-7> [scopehal-apps] azonenberg e085a48 - Lots of work on preferences system in ngscopeclient

16:11 <_whitenotifier-7> [scopehal-apps] azonenberg 14e2e26 - Removed comment about unnecessary fix

16:11 <_whitenotifier-7> [scopehal-apps] azonenberg d8627a0 - Updated submodules. Fixed alpha calculation in ngscopeclient.

16:11 <azonenberg> lain: try with d8627a0 (latest master)

16:11 <azonenberg> it renders for me

16:11 <azonenberg> i fixed a few things in AcceleratorBuffer for non-POD datatypes recently

16:19 <azonenberg> (also i fixed a bug in the trc import filter recently that you might not have merged yet)

16:21 mikolajw has quit [*.net *.split]

16:21 whitequark has quit [*.net *.split]

16:21 asy_ has quit [*.net *.split]

16:21 jevinskie[m] has quit [*.net *.split]

16:21 lain has quit [*.net *.split]

16:21 sajattack[m] has quit [*.net *.split]

16:21 miek has quit [*.net *.split]

16:21 JSharp has quit [*.net *.split]

16:21 lethalbit has quit [*.net *.split]

16:21 t4nk_fn has quit [*.net *.split]

16:21 gruetzkopf has quit [*.net *.split]

16:21 ericonr has quit [*.net *.split]

16:21 sorear has quit [*.net *.split]

16:21 agg has quit [*.net *.split]

16:21 josuah has quit [*.net *.split]

16:21 electronic_eel has quit [*.net *.split]

16:21 bvernoux has quit [*.net *.split]

16:21 benishor has quit [*.net *.split]

16:21 massi has quit [*.net *.split]

16:21 tiltmesenpai has quit [*.net *.split]

16:21 bgamari has quit [*.net *.split]

16:21 Yamakaja has quit [*.net *.split]

16:21 monochroma has quit [*.net *.split]

16:21 Fridtjof has quit [*.net *.split]

16:21 esden has quit [*.net *.split]

16:21 tnt has quit [*.net *.split]

16:21 anuejn has quit [*.net *.split]

16:21 vup has quit [*.net *.split]

16:21 _florent_ has quit [*.net *.split]

16:21 welterde has quit [*.net *.split]

16:21 elms has quit [*.net *.split]

16:21 kbeckmann has quit [*.net *.split]

16:21 florolf has quit [*.net *.split]

16:21 mxshift has quit [*.net *.split]

16:21 Stary has quit [*.net *.split]

16:21 balrog has quit [*.net *.split]

16:21 d1b2 has quit [*.net *.split]

16:21 Stephie has quit [*.net *.split]

16:22 kbeckmann has joined #scopehal

16:22 esden has joined #scopehal

16:22 Yamakaja has joined #scopehal

16:22 bvernoux has joined #scopehal

16:22 massi has joined #scopehal

16:22 tiltmesenpai has joined #scopehal

16:22 lain has joined #scopehal

16:22 balrog has joined #scopehal

16:22 jevinskie[m] has joined #scopehal

16:22 whitequark has joined #scopehal

16:22 mikolajw has joined #scopehal

16:22 sajattack[m] has joined #scopehal

16:22 agg has joined #scopehal

16:22 asy_ has joined #scopehal

16:22 josuah has joined #scopehal

16:22 electronic_eel has joined #scopehal

16:22 t4nk_fn has joined #scopehal

16:22 bgamari has joined #scopehal

16:22 lethalbit has joined #scopehal

16:22 miek has joined #scopehal

16:22 JSharp has joined #scopehal

16:22 _florent_ has joined #scopehal

16:22 vup has joined #scopehal

16:22 tnt has joined #scopehal

16:22 welterde has joined #scopehal

16:22 florolf has joined #scopehal

16:22 elms has joined #scopehal

16:22 Stary has joined #scopehal

16:22 Stephie has joined #scopehal

16:22 d1b2 has joined #scopehal

16:22 anuejn has joined #scopehal

16:22 sorear has joined #scopehal

16:22 ericonr has joined #scopehal

16:22 gruetzkopf has joined #scopehal

16:22 monochroma has joined #scopehal

16:22 benishor has joined #scopehal

16:22 Fridtjof has joined #scopehal

16:22 mxshift has joined #scopehal

16:23 <lain> azonenberg: wb

16:25 <azonenberg> yay netsplits

16:25 <azonenberg> anyway did the latest master from a few minutes ago render for you?

16:26 <lain> lemme test

16:27 <lain> buildy buildy

16:27 <lain> buildy buildy :3

16:28 <lain> yep fixed

16:28 <lain> what'd you change? or should I just read the commit logs :P

16:30 <azonenberg> among other things i temporarily disabled the vulkan implementation of the int8/int16 -> fp32 conversion filter

16:30 <azonenberg> which is used by the .trc importer

16:30 <azonenberg> i dont know why it stopped working, it used to work and i didnt change anything

16:31 <azonenberg> but it didn't actually save us that much run time

16:31 <lain> oh I thought you already did that last night

16:31 <azonenberg> I did

16:31 <azonenberg> but i hadn't pushed the new submodule pointer to scopehal-apps

16:31 <lain> ahhhh ok

16:31 <azonenberg> because i had incomplete stuff in my working copy and didnt want to have to cherrypick part of it to push

16:31 <lain> gotcha

16:36 <_whitenotifier-7> [scopehal-apps] azonenberg opened issue #526: Figure out how to handle multiple defaults for colors depending on GUI theme - https://github.com/glscopeclient/scopehal-apps/issues/526

16:36 <_whitenotifier-7> [scopehal-apps] azonenberg labeled issue #526: Figure out how to handle multiple defaults for colors depending on GUI theme - https://github.com/glscopeclient/scopehal-apps/issues/526

16:50 massi has quit [Remote host closed the connection]

16:51 <lain> azonenberg: so I'm actually having trouble finding a previous commit that works as intended

16:54 <azonenberg> lain: try a10fe6c

16:55 <lain> doesn't work

16:56 <lain> I'll rebuild to see why it didn't work, but that was the first one I tried

16:57 <azonenberg> huh

16:57 <azonenberg> i swear i tried that last night and it was ok

16:58 <lain> will know in a few minutes!

16:58 <azonenberg> anyway, i just want a fix. i'd suggest focusing more on root cause than diagnosing a potential regression

16:58 <azonenberg> treat it as a new bug

16:58 <lain> hm ok

16:58 <lain> I was hoping to use bisect to narrow down the introduction point

16:58 <azonenberg> yeah i know

17:00 <lain> Warning: ReadDataFile: Could not open file "shaders/Convert16BitSamples.spv"

17:00 <lain> terminate called after throwing an instance of 'vk::UnknownError'

17:00 <lain> what(): vkCreateShaderModule: ErrorUnknown

17:00 <lain> fish: “./src/glscopeclient/glscopeclie…” terminated by signal SIGABRT (Abort)

17:00 <azonenberg> lol, ok

17:00 <azonenberg> so that's the import filter not being happy because of missing shader config

17:01 <azonenberg> you can probably just comment out the gpu code in TRCImportFilter as a quick workaround?

17:03 <lain> mmm it just segfaults

17:04 <azonenberg> or also try updating scopehal to latest

17:04 <azonenberg> and only using the old scopehal-apps code

17:04 <lain> oh, I patched it badly

17:04 <azonenberg> ah ok

17:04 <azonenberg> that would do it

17:09 <lain> eh, ok, this has bugs from the vulkan render shaders in it

17:09 <lain> I'mmm just gonna examine this as a new bug as you suggested :P

17:09 <azonenberg> yeah

17:10 <azonenberg> i feel like we've made extensive enough changes trying to bisect would be a nightmare and the diff would essentially be the entire shader rewrite

17:10 <lain> yeahhh

17:11 <azonenberg> as a starting point, i would be very suspicious of all math on x axis values especially if there is any chance of being cast to float32 at some point in the process

17:11 <azonenberg> i attempted to keep stuff in the int64 domain as long as possible such that after the offset, the dynamic range from left to right of a single plot would easily fit in fp32

17:11 <azonenberg> but that may have gone wrong somewhere

17:12 <azonenberg> as a first order experiment, i suggest patching some code at various points to act as if the samples were uniform

17:13 <azonenberg> essentially invent a fictional timebase and ignore the actual offset values in the code

17:13 <azonenberg> see what happens

17:13 <azonenberg> but i'm not sure if that will tell you anything useful

17:13 * lain nod

17:13 <azonenberg> it will at least let you confirm the problem is wrt x axis math i guess

17:14 <lain> true, yeah

17:19 <lain> azonenberg: just to confirm, this is how it SHOULD look, right? https://cdn.discordapp.com/attachments/776941750291267595/1026712786895642714/precision.png

17:19 <lain> or is that also exhibiting the bug

17:20 <azonenberg> That giant gap at left should not be there

17:20 <lain> ah ok

17:20 <azonenberg> you'll notice as you zoom in it comes and goes

17:20 <azonenberg> and some samples appear and disapppear

17:21 <azonenberg> i think due to it miscalculating the left/right bounds of which samples go in which pixels

17:21 <lain> yep

17:21 <azonenberg> so some columns of pixels get no samples or something

17:21 <azonenberg> It would not surprise me if at least some of the bug is in PrepareGeometry() rather than the actual shader

17:21 <azonenberg> We had a similar bug some time ago and iirc converting "xscale" from fp32 to fp64 fixed it

17:21 <azonenberg> but it's fp64 now and we still have the issue

17:22 <azonenberg> but that may not have been a complete fix, idk

17:25 <lain> hmm

17:26 <lain> so I added some quick debug code to WaveformArea::RenderTrace, to just LogDebug whether data->IsDensePacked() is true or false

17:27 <lain> should there be multiple waveforms in the C2-C1 WaveformArea?

17:29 <lain> oh nvm I misread the output

17:30 <azonenberg> The top plot is the uniform analog differential input signal (computed by subtracting two signals from the trc import filter) with three digital overlays

17:31 <azonenberg> there's a threshold which is uniform as well i think

17:31 <azonenberg> the CDR PLL which is sparse

17:31 <azonenberg> and the PRBS check which is sparse and should be all zeroes (no PRBS errors)

17:31 octorian has quit [Ping timeout: 244 seconds]

17:31 <azonenberg> the second plot is the extracted cycle to cycle jitter and should be one sample per UI, sparse analog

17:31 octorian_ has joined #scopehal

17:35 octorian_ is now known as octorian

17:37 <lain> hmm

17:37 <lain> I'm suspicious of the zero hold stuff

17:37 <lain> still debugging

17:40 <azonenberg> yes, i was too

17:40 <azonenberg> it seemed to work ok when i tested prior to merging

17:40 <azonenberg> but i did not, in retrospect, test with any extremely long 1fs resolution sparse waveforms

17:40 <azonenberg> Which is where numerical precision errors would creep in

17:42 <lain> ah, I found a typo that would cause unpredictable behavior

17:42 <azonenberg> ooh

17:42 <_whitenotifier-7> [scopehal-apps] azonenberg pushed 1 commit to master [+0/-0/±4] https://github.com/glscopeclient/scopehal-apps/compare/d8627a0b1443...bf3c3aaf731c

17:42 <_whitenotifier-7> [scopehal-apps] azonenberg bf3c3aa - Added support for preferences of "color" type in ngscopeclient. Added prefs for grid configuration. Improved display of grid to avoid colliding labels. See #522.

17:42 massi has joined #scopehal

17:42 <azonenberg> from your refactoring or the zhold patches?

17:42 <lain> not *the* bug, but a bug

17:43 <lain> zhold patches

17:44 <lain> https://github.com/glscopeclient/scopehal-apps/blob/master/src/glscopeclient/WaveformArea_rendering.cpp#L165

17:44 <lain> looks like this should be sdigdat->m_durations

17:44 <azonenberg> yes, it does indeed look like it

17:44 <lain> I guess that wouldn't affect M1 though because unified memory :P

17:45 <azonenberg> no it does

17:45 <azonenberg> we are not using it as unified memory

17:45 <azonenberg> we have two different buffers we copy between

17:45 <azonenberg> they happen to be in the same address space

17:45 <azonenberg> but it's still two incoherent blocks of memory

17:46 <azonenberg> that need explicit synicng

17:46 <azonenberg> once we optimize to use a single buffer that's a different story

17:46 <_whitenotifier-7> [scopehal-apps] azonenberg pushed 1 commit to master [+0/-0/±1] https://github.com/glscopeclient/scopehal-apps/compare/bf3c3aaf731c...4250efe5e717

17:46 <_whitenotifier-7> [scopehal-apps] azonenberg 4250efe - Fixed copy-paste bug with duration values

17:46 <azonenberg> anyway just pushed a fix for that

17:46 <azonenberg> so i just realized something that further points to rounding or numerical precision issues

17:47 <azonenberg> Open the test case, click on the timeline, and drag it left/right veeery slowly

17:47 <azonenberg> note what looks like aliasing behavior

17:47 <azonenberg> pixels come and go as the waveform moves

17:47 <azonenberg> and the areas we render seem to shift as the x offset shifts

17:48 <azonenberg> you can also see that the waveforms have an area at the left that always renders correctly, then things go haywire

17:48 <azonenberg> aaand guess what

17:49 <lain> yep

17:49 <lain> :o

17:49 <lain> did you find it lol

17:49 <azonenberg> no

17:49 <azonenberg> but i have more evidence

17:50 <azonenberg> the first problems seem to crop up just about 2.5 ns

17:51 <azonenberg> hmm, nvm. i was off by a few OOMs

17:51 <azonenberg> i was thinking 2^31 fs

17:51 <azonenberg> but that's 2147 ns

17:52 <azonenberg> But, 2^23 fs is only 8 ns

17:52 <azonenberg> (23 bit mantissa in ieee754 fp32)

17:53 <azonenberg> which is the right order of magntiude for where we see problems

17:54 <azonenberg> if we do a bunch of multiplies and divides i could totally see rounding errors becoming >1 pixel at that scale

18:14 <lain> hmm

18:14 <lain> azonenberg: likely unrelated, but I'm looking at the localSize calculation at the start of WaveformArea::RenderTrace, it says it must match COLS_PER_BLOCK in waveform-compute shader, but there is no COLS_PER_BLOCK in there, so I'm guessing this code is obsolete

18:15 <lain> looks like it's just calculating numGroups, which becomes the number of x invocations for the shader

18:15 <azonenberg> let me look, h/o

18:15 <azonenberg> that is for the *y* axis size iirc

18:15 <azonenberg> or wait

18:15 <azonenberg> oh ok. yeah

18:15 <azonenberg> it's mostly obsolete. we used to support two or more columns of pixels in one shader thread

18:16 <azonenberg> one shader group*

18:16 <azonenberg> also, here's an interesting finding

18:16 <lain> ah ok

18:16 <azonenberg> I set indexBuffer to {0}

18:16 <azonenberg> i.e. every thread starts rendering from x=0

18:17 <azonenberg> this is of course stupidly inefficient as we spend a ton of time transforming samples we'll never see

18:17 <azonenberg> but more interesting was the outcome

18:17 <d1b2> <azonenberg> https://cdn.discordapp.com/attachments/776941750291267595/1026920970927226910/index-zero.png

18:18 <lain> er, hm

18:18 <azonenberg> everything renders perfectly, then it stops right where it would have gone bad

18:18 <lain> isn't indexBuffer set in PrepareGeometry?

18:18 <azonenberg> Yes

18:18 <lain> oh I see you're syaing you just set it to {0} as a test

18:18 <azonenberg> i patched out the BinarySearchForGequal call to return constant zero

18:18 <lain> fascinating o.o

18:18 * lain thinks

18:19 <azonenberg> i would have expected it to draw the entire waveform, just very slowly

18:19 <lain> indeed

18:19 <azonenberg> fwiw, there is apparently the ability to printf or log messages from shader code in vulkan using the validation layers

18:19 <lain> that smells like a shader issue then

18:19 <azonenberg> i havent used it

18:19 <azonenberg> but it may be worth trying

18:20 <azonenberg> it seems that this is mutually exclusive with gpu assisted bounds checking etc

18:20 <azonenberg> but we dont seem to be going OOB here

18:20 <azonenberg> as none of those checks are firing

18:23 <lain> azonenberg: does your gpu have int64 support? wondering if I can ignore !HAS_INT64 issues

18:23 <lain> iirc my M1 GPU does not have int64, so it's using the more annoying path

18:24 <azonenberg> yes

18:24 <lain> kk

18:24 <azonenberg> your nvidia should too i think

18:24 <lain> seems likely

18:25 <lain> I'm testing on the M1 currently since it doesn't seem to make a difference which machine I test on, and the nvidia machine is back in WA and VNC is slow :P

18:27 <_whitenotifier-7> [scopehal] azonenberg pushed 1 commit to master [+0/-0/±1] https://github.com/glscopeclient/scopehal/compare/3aeb074fe095...b10a70083a4d

18:27 <_whitenotifier-7> [scopehal] azonenberg b10a700 - Request VK_KHR_shader_non_semantic_info if available (required for shader based debug printf)

18:28 <azonenberg> ok so to get printf output from shaders, enable it in the vulkan config tool

18:28 <azonenberg> add #extension GL_EXT_debug_printf : require to the shader

18:28 <azonenberg> then call debugPrintfEXT() from the shader

18:28 <azonenberg> i need to get back to $dayjob stuff so bbl

18:28 <lain> kk

18:30 <azonenberg> and also pull latest (b10a700) scopehal

18:30 <azonenberg> which requests a device extension the printf requires

18:31 <lain> yeah, already did.. it's complaining of: program_source:88:9: error: use of undeclared identifier 'debugPrintfEXT'

18:31 <lain> currently debugging

18:31 <azonenberg> did you add that extension?

18:31 <lain> yep

18:31 <azonenberg> huh because i just did that on my end and it worked fine

18:31 <azonenberg> is your glslc too old maybe?

18:32 <lain> well this is happening at runtime

18:32 <lain> it compiles fine at build

18:32 <azonenberg> https://github.com/KhronosGroup/GLSL/blob/master/extensions/ext/GLSL_EXT_debug_printf.txt

18:32 <azonenberg> um

18:32 <azonenberg> at run time??

18:32 <azonenberg> i wonder if moltenvk doesnt implement that extension

18:32 <lain> hrm

18:33 <azonenberg> see if it says anything about VK_KHR_shader_non_semantic_info in the log

18:33 <azonenberg> or in vulkaninfo

18:33 <azonenberg> (device extension not instance extension)

18:34 <azonenberg> You might have to try this on the nvidia card. the nvidia should have it

18:34 <lain> hmm yeah I don't see it in there

18:34 <lain> alright

18:34 <azonenberg> yeah so its a moltenvk limitation then

18:48 <_whitenotifier-7> [scopehal] azonenberg commented on issue #688: "Unrecognized dataset type" when importing Tek .WFM file - https://github.com/glscopeclient/scopehal/issues/688#issuecomment-1267441628

18:48 <_whitenotifier-7> [scopehal] azonenberg closed issue #688: "Unrecognized dataset type" when importing Tek .WFM file - https://github.com/glscopeclient/scopehal/issues/688

19:00 <d1b2> <Mughees> Is Virtual machine supported for glscope?

19:01 <azonenberg> mughees: yes and no

19:01 <azonenberg> if you just throw together a VM, it probably won't work well/fast

19:02 <azonenberg> GPU acceleration in VMs tends to not be great

19:02 <azonenberg> with pcie passthrough of a dedicated GPU, it should work fine

19:02 <azonenberg> software emulation using swiftshader may be an option but we have not tested

19:02 <d1b2> <Mughees> alright...actually my hard disk gone bad..so had to switch to a windows laptop

19:02 <azonenberg> software emulation using llvmpipe will work out of the box eventually (although slow) but last time we tried we had some issues with it

19:02 <d1b2> <david.rysk> virgl may work

19:03 <d1b2> <david.rysk> If your host supports it

19:03 <d1b2> <Mughees> it does kind a work..but crashes while genrating a sinewave

19:03 <azonenberg> glscopeclient does build/run on windows also, although the build is a bit of a mess right now

19:03 <d1b2> <david.rysk> IIRC it worked last time I tried, was just not so fast

19:03 <d1b2> <Mughees> In windows, are we able to edit code within MSYS2 environemnt?

19:04 <azonenberg> We're working on improving that. but the best path forward involves leaving GTK and moving entirely to ngscopeclient. which is several months from being at feature parity with glscopeclient

19:04 <azonenberg> msys2/mingw files are just normal files on your windows disk

19:04 <azonenberg> that you can edit with your text editor of choice

19:04 <d1b2> <Mughees> ok

19:04 <azonenberg> so you can edit the code there then compile in msys2

19:04 <d1b2> <Mughees> I think it is better to oreder a new hard 🙂

19:04 <azonenberg> We want to transition away from msys2 and just generate visual studio projects with cmake

19:05 <azonenberg> but while doing that with GTK is not impossible, it's a huge pain

19:05 <azonenberg> One of about a dozen reasons we're leaving GTK

19:11 <lain> ok there is definitely something weird in the shader

19:11 <lain> still debugging, but I'm seeing some nonsense values from the push constants

19:12 <azonenberg> struct packing issue?

19:13 <lain> quite possibly, hm

19:13 <lain> innerXoff=-1542624 window=4294967295x2073 memDepth=180 offset_samples=645832 alpha=0.000000 xoff=2.000000 xscale=0.000000 ybase=0.000113 yscale=90.000000 yoff=0.014734 persistScale=192.500000

19:14 <azonenberg> might also be 64 bit ints not handled by debugprintfext

19:14 <azonenberg> may need to explicitly print low/high halves or something?

19:14 <lain> ah true I should check the docs for the format specifiers

19:15 <lain> >No length modifiers. Everything except ul, lu, and lx is 32 bits, and ul and lx values are printed in hex

19:15 <azonenberg> is lu 64 bits?

19:15 <lain> it doesn't say, but that's my guess

19:16 <lain> but wouldn't lu be unsigned? whereas I have an int64_t

19:16 <lain> hm

19:16 <azonenberg> so there's no ld

19:16 <lain> seems not

19:16 <lain> well I can use %lx at least

19:16 <azonenberg> yeah

19:17 <lain> okay yeah it was just an issue with that

19:17 * lain back to hunting!

19:31 <lain> this may be a quirk of debugPrintfEXT, but I'm only ever seeing output from gl_GlobalInvocationID.x == 0, despite numGroups=1969

19:34 <lain> oh hm

19:37 <azonenberg> i got output from everything when i tried

19:37 <azonenberg> you do have to set the buffer size in the validation box

19:37 <azonenberg> it can be up to a megabyte

19:37 <lain> ohh there we go

19:37 <lain> thx

19:51 <bvernoux> it seems limit of GitHub are reached

19:51 <bvernoux> https://github.com/glscopeclient/scopehal-apps/actions/runs/3184281422/jobs/5192502382

19:51 <bvernoux> I suspect it is because there is not enough disk space ...

19:53 <azonenberg> Yes

19:53 <azonenberg> Yet another reason to move to our own local CI runners in the near future

20:03 <lain> hmm

20:08 <lain> oh, that's interesting

20:08 <lain> InterpolateY isn't using vec2 right at all

20:09 <azonenberg> ?

20:10 <lain> float InterpolateY(vec2 left, vec2 right, float slope, float x)

20:10 <lain> {

20:10 <lain> return left.y + ( (x - left.x) * slope );

20:10 <lain> }

20:10 <lain> so far I see no evidence of numerical instability, I'm beginning to wonder if this is a logic error

20:10 <azonenberg> very possible

20:11 <azonenberg> All i can say with certainty at this point is it's a bug :p

20:13 <azonenberg> try printing out the actual digital samples you process around x=500 or wherever the big gap is in the CDR trace

20:13 <azonenberg> look at start/end, see if you are taking an early-out too soon or being passed incorrect index data

20:18 <bvernoux> nice latest ngscopeclient rendering in demo is even faster

20:18 <bvernoux> I have about 58fps in full screen

20:18 <bvernoux> before it was slower with framerate between 48fps to 56gps

20:21 <bvernoux> nice there is the cursor too

20:21 <bvernoux> which work fine

20:22 <bvernoux> and the menu is instant even if so far there is only that option it is day and night vs glscopeclient right menu click

20:25 <azonenberg> i mean there isnt much there. glscopeclient does a lot more work to set up the menu

20:25 <azonenberg> but we will definitely be caching heavily

20:26 <lain> azonenberg: time ticks are, what, femtoseconds?

20:27 <azonenberg> in a normal time domain waveform, yes

20:27 <azonenberg> in frequency domain, Hz

20:27 <azonenberg> although we will likely shift to mHz or uHz to provide better resolution in the future

20:29 <lain> hrm

20:34 <lain> not seeing anything weird with the voltage values, interesting...

20:35 * lain tests some ground truths

20:36 <azonenberg> are you looking at the cdr pll or what?

20:36 <azonenberg> the pll output is digital so less things to go wrong i think

20:37 <azonenberg> probably easier to test there

20:37 <lain> hm ok

20:37 <lain> I've been looking at the TIE measurement

20:38 <lain> oooh ok

20:38 <lain> getting somewhere now...

20:38 <azonenberg> oh?

20:38 <azonenberg> are you seeing some pixels that dont render any samples?

20:38 <azonenberg> because i'm very sure that's what is happening

20:38 <azonenberg> i just dont know *why*

20:38 <lain> yes

20:39 <lain> I replaced voltage[i] with a fixed value in the calculations for left and right in the main loop

20:39 <lain> I would expect to see a straight line

20:39 <azonenberg> And you don't?

20:39 <lain> it is a straight line but it has the missing pixels

20:40 <azonenberg> try printf'ing the total number of samples drawn in each column of pixels

20:40 <lain> I think you're right that it's an issue in the x calculation somewhere

20:40 <azonenberg> i bet you'll see some threads that do nothing

20:40 <lain> ahm

20:47 <lain> azonenberg: wait how would you calculate that in the shader?

20:48 <lain> looks like g_updating[gl_LocalInvocationID.y] gets set true if that pixel was updated, else false

20:49 <azonenberg> Yeah

20:49 <lain> ok yeah

20:49 <azonenberg> so waht you'd do is add a bit of logic in thread y=0 of the block

20:49 <azonenberg> that sums g_updating each pass through the main loop

20:49 <azonenberg> into a local

20:49 <azonenberg> records a total count of how many pixels that thread wrote to

20:49 <azonenberg> then after the end of the loop, printf it

20:50 <azonenberg> (only in y=0)

20:50 <azonenberg> since you dont care which *threads* handled the samples

20:50 <azonenberg> you just care about the whole block drawing zilch

20:50 massi has quit [Remote host closed the connection]

20:53 <lain> ok I'm forgetting how this works, would I use the global or local invocation's y to check for == 0 ?

20:53 <lain> or I can just look it up in a sec

20:53 <lain> but brb, dr appt for a bit

20:55 <azonenberg> local

20:55 <azonenberg> local is thread index within the block

21:39 <lain> ok back

21:43 <lain> azonenberg: ok so I added a 'shared uint total_updated;', and then at the top of main(), if local y=0 I set total_updated = 0;...

21:44 <lain> near the end of the main loop there's an if(g_updating[y]) { ... }, and I increment total_updated in there, and then after the main loop, if this is y==0, I print the value

21:44 <lain> does that sounds right?

21:45 <lain> and indeed I'm getting plenty of zeroes in the output

21:46 <lain> ok, so why aren't some threads doing anything... hm hmmmmmm!

21:49 <azonenberg> yeah that sounds about right. you might have some race conditions if you dont use atomic shareds or something

21:49 <azonenberg> but for this purpose it should be good enough

21:49 <azonenberg> actually total_updated doesnt even have to be shared

21:49 <azonenberg> you can just only write it in y=0

21:49 <lain> oh, true

21:50 <azonenberg> just make sure you check if g_updating[] i true for any Y

21:50 <azonenberg> anyway, point is you're getting lots of threads doing nothing, just as i suspected

21:50 <azonenberg> the why, i havent got a clue :p

21:51 <lain> so I just added a check

21:52 <lain> hmm

21:56 <lain> narrowing down what path it's taking...

21:58 <lain> ok it's hitting this path a bunch

21:58 <lain> //Skip offscreen samples

21:58 <lain> if( (right.x >= gl_GlobalInvocationID.x) && (left.x <= gl_GlobalInvocationID.x + 1) )

21:59 <lain> when that is false, the thread doesn't draw

21:59 <lain> and it seems to be false very often

22:00 <lain> wait

22:00 <lain> is that reversed lol

22:01 <azonenberg> Sounds fishy

22:01 <azonenberg> let's think

22:01 <azonenberg> we want the sample to start before the right edge of our pixel

22:01 <azonenberg> and end after the left edge

22:02 <azonenberg> so that sounds correct?

22:02 <azonenberg> is this with you using the proper index, or hardcoded to zero?

22:02 <lain> proper index

22:02 <azonenberg> Patch the index to zero to eliminate it as a variable

22:03 <azonenberg> the index should only be a speed optimization

22:03 <azonenberg> if it changes behavior - which it does - that implies we have a different bug

22:03 <azonenberg> let's focus on one bug at a time

22:05 <lain> ok so essentially replace xind[...] with 0 ?

22:06 <lain> err no that can't be right?

22:06 <azonenberg> I'd do it in preparegeometry to avoid making too many changes to the shader

22:06 <lain> it's only used in two places

22:06 <azonenberg> indexBuffer[j] = 0

22:06 <azonenberg> the index is just the starting point for the thread

22:07 bvernoux has quit [Quit: Leaving]

22:07 <lain> uint iend = xind[gl_GlobalInvocationID.x + 1];

22:07 <lain> if(iend <= 0)

22:07 <lain> g_done = true;

22:07 <lain> hrm

22:07 <lain> wouldn't this set g_done=true right away though?

22:08 <azonenberg> yeah i'm thinking

22:08 <azonenberg> ok my mistake

22:08 <azonenberg> I'm not sure what that check is for

22:08 <azonenberg> oh

22:08 <azonenberg> this is to skip drawing samples before the 0th

22:09 <lain> shouldn't it be < rather than <= ?

22:09 <azonenberg> We currently fill index=0 from the left pixel in the plot up to the first sample that actually contains waveform data

22:09 <azonenberg> it would probably be better to fill with a negative value and change to -

22:09 <azonenberg> change to < 0

22:10 <azonenberg> see WaveformArea::BinarySearchForGequal()

22:10 <azonenberg> if we're searching for a value before the start of the waveform it returns 0

22:11 <azonenberg> (feel free to comment the renderer more aggressively, this is some of the heavily optimized arcane black magic that makes glscopeclient tick lol)

22:12 <lain> haha

22:20 <lain> hmm

22:21 <lain> azonenberg: https://dt.lain.land/Screen%20Shot%202022-10-04%20at%2016.20.31.png

22:22 <azonenberg> yep. giant gaps that shouldnt be there

22:22 <lain> this is such a weird bug lol

22:22 <lain> ok

22:23 <lain> so I'm going to patch voltage to 0 and indexes to 0

22:23 <azonenberg> and disable that early out for iend <= 0?

22:23 <lain> yeah

22:26 <lain> ok that's interesting

22:26 <azonenberg> ?

22:26 <lain> I haven't patched voltage=0 yet, but with index=0 and getting rid of that early out on iend <= 0, it seems to be working as expected

22:26 <lain> ok so there's something wrong with the logic there

22:27 <lain> or a larger problem lurking somewhere :P

22:27 <azonenberg> lol

22:28 <azonenberg> what happens if you *just* get rid of the early out

22:28 <azonenberg> and keep the real indexes

22:28 <lain> testing

22:28 <lain> I suspect this will work, I think the early out is the reason some threads were being lazy

22:28 <lain> yeah, this works fine

22:29 <lain> ok so it's that early out.

22:29 <azonenberg> dump the indexes?

22:29 <lain> sure

22:29 <azonenberg> maybe they're corrupted somehow

22:29 <azonenberg> They should be monotonic left to right with no 0s

22:29 <lain> I'll dump them from the shader and c++ to be extra sure

22:29 <azonenberg> all 0 from start to first sample, then increasing

22:29 <azonenberg> ok

22:31 <lain> OH

22:32 <lain> ohno.

22:32 <lain> the ... ok.

22:32 <lain> AcceleratorBuffer<int64_t> m_indexBuffer;

22:32 <lain> vs

22:32 <lain> uint xind[];

22:33 <azonenberg> um

22:33 <lain> that doesn't need to be an int64...

22:33 <azonenberg> that's no good lol

22:33 <azonenberg> So actually

22:33 <azonenberg> the fix should be on the host side

22:33 <azonenberg> using 32 bit indexes

22:33 <lain> yeah

22:33 <azonenberg> Because Vulkan has a 4GB memory allocation cap

22:33 <lain> ffs.

22:33 <azonenberg> so we can't have a waveform >1G point without major refactoring to split them into multiple buffers

22:34 <azonenberg> Sooo was this a refactoring regression?

22:34 <lain> I think so, I'm gonna go check when I introduced that and what it replaced to be sure, but I think so.

22:35 <azonenberg> anyway, good find

22:35 <azonenberg> awaiting PR once you confirm the fix :)

22:37 <lain> yep there it is

22:37 <lain> uint32_t*m_mappedIndexBuffer;

22:37 <lain> that was the original decl

22:37 <lain> and when moving it to AcceleratorBuffer I copypasta failed

22:37 <azonenberg> good luck finding that in a bisect lol

22:37 <lain> lol yeah jeez

22:37 <lain> welp.

22:38 * lain discards her debug changes, prepares a PR

22:39 <lain> actually it's a single commit, one-line change, mind if I just push to master?

22:39 <_whitenotifier-7> [scopehal-apps] azonenberg pushed 1 commit to master [+3/-0/±5] https://github.com/glscopeclient/scopehal-apps/compare/4250efe5e717...392e07afc132

22:39 <_whitenotifier-7> [scopehal-apps] azonenberg 392e07a - Initial work on font manager. Not yet integrated with preferences.

22:39 <azonenberg> Go for it

22:41 <_whitenotifier-7> [scopehal-apps] lainy pushed 1 commit to master [+0/-0/±1] https://github.com/glscopeclient/scopehal-apps/compare/392e07afc132...02b3c7f386b9

22:41 <_whitenotifier-7> [scopehal-apps] lain 02b3c7f - Fix regression for sparse waveform rendering.

22:55 <azonenberg> lain: fix confirmed

22:56 <azonenberg> thanks. back to macos stuff now? you said your branch is ready to check out and merge?

23:00 <lain> yeah I'm just merging master again right now as there's some changes I need to fix up

23:07 <lain> hrm

23:08 <lain> azonenberg: ok so TextureManager and Texture need access to a queue and cmdbuf under the new queue manager model. iirc the existing code just uses globals. is it safe to have them use the global transfer queue, or does it *need* to be the same render queue used by the main window?

23:09 <azonenberg> The global transfer queue needs to go away

23:09 <azonenberg> Because we need to support operating on intel iGPUs that have only one queue

23:09 <lain> alternatively I can have TextureManager grab a queue of its own, make a command pool, and a command buffer... but then locking access to the cmdbuf is mildly annoying

23:09 <azonenberg> ok let me rephrase

23:09 <azonenberg> we cannot dedicate a queue to transfer permanently

23:09 <azonenberg> so it will have to be a QueueHandle

23:09 <lain> yes

23:10 <azonenberg> Using the global transfer queue should be fine

23:10 <lain> so in my branch, g_vkTransferQueue is a QueueHandle

23:10 <azonenberg> Where it gets challenging is the sharing aspect

23:10 <lain> yeah, avoiding deadlocks

23:10 <azonenberg> where you have to declare it as being shareable to any queue it might be rendered on

23:10 <lain> ohh

23:10 <lain> that sharing aspect

23:10 <azonenberg> yeah

23:10 <lain> hm yeah...

23:11 <azonenberg> So you need to have some way for QueueManager to give you a list of any queue that could possibly be used for rendering, and any queue that could possibly be g_vkTransferQueue

23:11 <azonenberg> and share it with all of them

23:11 <lain> bluh :P

23:11 <azonenberg> If we can statically allocate queues on systems with more queues, and eliminate locking, that will be good

23:11 <azonenberg> but we can't give up functionality on lower end hardware outright

23:11 <lain> or I could have MainWindow pass its render QueueHandle to TextureManager when it instantiates it

23:12 <azonenberg> But the render QueueHandle isn't guaranteed to be the same underlying queue object every time you lock it, right?

23:12 <lain> it is yes

23:12 <azonenberg> ah, ok

23:12 <azonenberg> in that case that's likely the simplest solution

23:12 <azonenberg> yay concurrency :D

23:12 <lain> the only annoying part there is I need to expose a mutex from TextureManager in case multiple Texture objects are being created at the same time (and thus all trying to hit TextureManager::m_cmdBuf at the same time)

23:13 <lain> hm

23:13 <azonenberg> You'd need one anyway, no?

23:14 <lain> yeah

23:14 <azonenberg> because TextureManager has the list of textures that isnt thread safe

23:14 <lain> ah, true

23:14 <azonenberg> But i'm not sure if we actually ever allocate textures outside the main thread

23:14 <azonenberg> i think we only ever use textures for tone mapping (in the main thread) and when loading toolbar icons

23:14 <lain> I'll just make a note

23:14 <azonenberg> Yeah i think it's safe to not lock it

23:14 <azonenberg> we do tons of stuff in background threads and on other queues but it's all compute shaders

23:21 <lain> hmm

23:21 <lain> my branch is a lil crashy, I'll want to fix that before we attempt a merge

23:21 <lain> looks like that's my agenda for tomorrow

23:22 <azonenberg> ok. i'm working on gui preferences and font handling in ngscopeclient

23:22 <azonenberg> shouldn't be conflicting

23:22 <lain> I think it's just an order of operations issue with resource release, will confirm tomorrow

23:23 <lain> oo, also some fence issues around command buffers, I can see how that would slip by with how I've implemented queue manager.. well, should be a pretty easy fix

23:57 bgamari has quit [Quit: ZNC 1.8.2 - https://znc.in]