#scopehal on 2024-09-09 — irc logs at libera.irclog.whitequark.org

2023-10-21 05:40 azonenberg changed the topic of #scopehal to: ngscopeclient, libscopehal, and libscopeprotocols development and testing | https://github.com/ngscopeclient/scopehal-apps | Logs: https://libera.irclog.whitequark.org/scopehal

02:41 Degi has quit [Ping timeout: 252 seconds]

02:42 Degi has joined #scopehal

04:39 <_whitenotifier-1> [scopehal] azonenberg pushed 3 commits to master [+0/-0/±3] https://github.com/ngscopeclient/scopehal/compare/84c8c8099591...98ffae9c1e30

04:39 <_whitenotifier-1> [scopehal] azonenberg 6a6eb86 - XYSweepFilter: added gating input

04:39 <_whitenotifier-1> [scopehal] azonenberg 8808871 - PkPkMeasurement: fixed incorrect units on scalar outputs

04:39 <_whitenotifier-1> [scopehal] azonenberg 98ffae9 - ScalarStairstepFilter: improved error handling

04:39 <_whitenotifier-1> [scopehal-apps] azonenberg pushed 1 commit to master [+0/-0/±2] https://github.com/ngscopeclient/scopehal-apps/compare/2dfa83189a62...98536665c0b7

04:39 <_whitenotifier-1> [scopehal-apps] azonenberg 9853666 - WaveformArea: show help message on secondary trigger level arrow too

05:03 <_whitenotifier-1> [scopehal] azonenberg pushed 1 commit to master [+0/-0/±2] https://github.com/ngscopeclient/scopehal/compare/98ffae9c1e30...82c4369a1d97

05:03 <_whitenotifier-1> [scopehal] azonenberg 82c4369 - ScalarStairstepFilter: added restart command

10:58 <d1b2> <0xdiansheng> Thanks!

12:41 <_whitenotifier-1> [scopehal-sigrok-bridge] MorganTL forked the repository - https://github.com/MorganTL

17:27 <_whitenotifier-1> [scopehal] fredzo opened pull request #886: Dho support fixes - https://github.com/ngscopeclient/scopehal/pull/886

17:32 <d1b2> <fredzo_72653> @azonenberg 🔼 PR to fix time-base settings from ngscopeclient on DHO models. Its working nicely on my DHO900 but will need testing on DHO1000/4000. @josHua if you can take a look with your DHO4000: is kind of guessed the samplerate et memory depth values for this model... Also on the DHO800/900 the timebase factor needed to calculate the horizontal scale from srate and mdepth is not constant (varries from 10 to 20 depending on the

17:32 <d1b2> srate...). Maybe you'll have to check if it's the same on the DHO4000.

17:34 <d1b2> <azonenberg> wrt horizontal scale, my guess is this has to do with whether interleaving is active

17:34 <d1b2> <azonenberg> i.e. are you sampling with one or two ADCs per channel

17:34 <d1b2> <fredzo_72653> Does not seem so: it varies event if only one channel is active

17:35 <d1b2> <azonenberg> huh

17:35 <d1b2> <azonenberg> interesting

17:35 <d1b2> <azonenberg> does it only turn on at really high sample rates?

17:35 <d1b2> <azonenberg> i.e. its possible they auto enable interleaving with one channel but only at the highest sample rates

17:35 <d1b2> <fredzo_72653> I had to create an excel sheet to understand what was happening!...

17:36 <d1b2> <azonenberg> lol

17:37 <d1b2> <fredzo_72653> https://cdn.discordapp.com/attachments/776941750291267595/1282756748691767418/image.png?ex=66e083d1&is=66df3251&hm=87c20915908d4d288fb51c45b3cfa9f9a94cce3c77dacce23d79516a1194abda&

17:38 <d1b2> <azonenberg> That feels like some kind of clock divider setting

17:40 <d1b2> <fredzo_72653> Yes that's what I thought... Srate values are quite odd on this model : starts at 1.25Gsa/s and gets divided but then drops to 156.250 Msa/s

17:41 <d1b2> <azonenberg> That's not odd at all

17:42 <d1b2> <fredzo_72653> right the next one is odd 😉

17:42 <d1b2> <azonenberg> at least, not at someone who spends a lot of time in the SERDES world

17:42 <d1b2> <azonenberg> 156.25 is 1250 / 8

17:42 <d1b2> <azonenberg> it's a very common clock rate for 1.25 Gbps SERDES with an 8-bit datapath, or a reference clock for a 1.25 Gbps SERDES PLL

17:43 <d1b2> <azonenberg> also 125 and 312.5 (divided by 4 and 10 respectively)

17:43 <d1b2> <azonenberg> or 625 for divide by two if you're just using DDR flipflops

17:43 <d1b2> <azonenberg> you see these frequencies in 10GbE too for the same reason

17:43 <d1b2> <azonenberg> 312.5 MHz is 10 Gbps / 32

17:44 <d1b2> <azonenberg> (although depending on how the clocking is structured since the line rate is 10.3125 Gbps, you sometimes see 322.265625 MHz or multiples/divisions of it)

17:44 <d1b2> <azonenberg> so basically it seems like they have a 1.25 Gsps ADC and divide down by integers from there

17:45 <d1b2> <fredzo_72653> Yes I guess there's a logic explanation for these values, I just don't see it from here 🙂

17:45 <d1b2> <azonenberg> Yeah, i just recognized integer divisions of 1.25 GHz instantly because it's such a common clock frequency

17:45 <d1b2> <azonenberg> especially for ethernet

17:46 <d1b2> <fredzo_72653> Andrew if I may take some more of your time, I have seg faults when using EyePattern with the DHO driver

17:47 <d1b2> <fredzo_72653> Thread 12 received signal SIGSEGV, Segmentation fault. [Switching to Thread 40216.0xa6a4] 0x00007fff63d709cb in EyePattern::DensePackedInnerLoop (this=this@entry=0x14d15010, waveform=0x14bc7630, clock_edges=std::vector of length 2218, capacity 4096 = {...}, data=data@entry=0x1dbd6040, wend=wend@entry=99999, cend=cend@entry=2217, xmax=xmax@entry=2047, ymax=ymax@entry=1023, xtimescale=xtimescale@entry=707788800,

17:47 <d1b2> yscale=yscale@entry=1280, yoff=yoff@entry=320) at D:/msys64/home/fborry/ngscopeclient/scopehal-apps/lib/scopeprotocols/EyePattern.cpp:558 558 pix[0] += 64 - bin2;

17:47 <d1b2> <azonenberg> Likely unrelated, i.e. its a driver crash or an eye filter crash

17:47 <d1b2> <azonenberg> i cant imagine a filter crash being specific to a driver

17:48 <d1b2> <azonenberg> Hmm. Can you save a scopesession with the offending waveform, clock. and data but the clock not supplied to the eye filter?

17:48 <d1b2> <azonenberg> and if so, does it crash reliably when you connect the clock?

17:48 <d1b2> <azonenberg> if you can get that, i'll try and debug on my end

17:49 <d1b2> <fredzo_72653> No it crashes after a few tenth of seconds of acquisition

17:49 <d1b2> <azonenberg> yeah but you have to hae set up the decode fully if it's reaching that point

17:49 <d1b2> <azonenberg> i'm talking about capturing waveforms first, then setting up the eye filter without connecting the clock

17:49 <d1b2> <azonenberg> so it shouldnt actually run th efilter

17:50 <d1b2> <azonenberg> then set up whatever filters you're using to generate the clock

17:50 <d1b2> <azonenberg> at this point you should have no crashes since the filter isn't being evaluated

17:50 <d1b2> <fredzo_72653> The clock is UART

17:50 <d1b2> <azonenberg> you're using the uart cdr filter? ok

17:51 <d1b2> <fredzo_72653> Uart Clock recovery filter

17:51 <d1b2> <azonenberg> anyway if you can acquire data first, save it, set up as much of the filter graph as possible before it crashes., saving after making each connection

17:51 <d1b2> <azonenberg> and tell me what the missing link is to make it crash

17:51 <d1b2> <azonenberg> then i'll see what happens

17:51 <d1b2> <fredzo_72653> OK just a sec

17:53 <d1b2> <fredzo_72653> At this point, no crash :

17:53 <d1b2> <fredzo_72653> https://cdn.discordapp.com/attachments/776941750291267595/1282760922263453716/image.png?ex=66e087b4&is=66df3634&hm=15902c0039d8420d53ecbc4ab6eeb75199ace76bdeb43f4d990bd91d9d474b91&

17:54 <d1b2> <azonenberg> ok. can you save the session in that state?

17:54 <d1b2> <azonenberg> and send me the session (make sure to include the _data directory as that's where the waveforms live)

17:54 <d1b2> <azonenberg> i'll load offline and see if i see anything wrong with it

17:55 <d1b2> <azonenberg> My guess is the uart cdr filter is doing something the eye filter doesn't like

17:55 <d1b2> <azonenberg> the eye filter is supposed to be bounds checking all of the clocks to make sure they're valid and preventing it from scribbling over offscreen pixel locations but i'm guessing something is buggy there

17:55 <d1b2> <fredzo_72653> Now it's running OK with the complete setup :

17:55 <d1b2> <fredzo_72653> https://cdn.discordapp.com/attachments/776941750291267595/1282761413961453638/image.png?ex=66e08829&is=66df36a9&hm=83b0481e993675d19bbd21aa79c12559d6d5f275bb55e5a7ea7f00bdd31f8fad&

17:56 <d1b2> <fredzo_72653> But it will crash eventually...

17:56 <d1b2> <fredzo_72653> Crashed 😕 After about 50sec

17:57 <d1b2> <azonenberg> That sounds like intermittent memory corruption where sometimes it hits an improtant value and sometimes it doesn't

17:57 <d1b2> <azonenberg> If you build with sanitizers, does that give any indications of what's going on?

17:58 <d1b2> <azonenberg> iirc it's cmake -DSANITIZE=ON -DCMAKE_BUILD_TYPE=DEBUG

17:58 <d1b2> <azonenberg> if it is memory corruption we should see complaints there

17:58 <d1b2> <azonenberg> (the question will then be why the checks intended to prevent this failed to do so)

17:59 <d1b2> <azonenberg> anyway, odds are there's out of bounds accesses even before it crashes they just don't give complete garbage

17:59 <d1b2> <azonenberg> so it's something we'll probably catch

17:59 <d1b2> <azonenberg> Send me whatever data you manage to get before crashing

18:00 <d1b2> <fredzo_72653> here you go

18:00 <d1b2> <fredzo_72653> https://cdn.discordapp.com/attachments/776941750291267595/1282762582989733929/uart-eye-pattern-crash.zip?ex=66e08940&is=66df37c0&hm=1baa51ae0a3c870cb05a3eb59da6af8baf9a6effa6cb8458808da3b59256a5be&

18:00 <d1b2> <azonenberg> Ok will have a look in a bit, about to make lunch

18:01 <d1b2> <fredzo_72653> OK thx, have a nice lunch, it dinner time here 🙂

18:42 <d1b2> <josHua> I will look in a moment. Did you back out my anti-fix for yoffset in there? If not, you should.

20:43 <d1b2> <fredzo_72653> xoffset you mean ? did not touch that no, it seamed to work for me

20:44 <d1b2> <azonenberg> @fredzo_72653 so the issue is the mismatch between how the driver behaves and how all other drivers do

20:44 <d1b2> <azonenberg> in that T=0 should be the start of the acquisition, not the trigger timestamp

20:45 <d1b2> <fredzo_72653> Oh that's what causes the segfault ?

20:46 <d1b2> <fredzo_72653> Or do you mean josHua's "anti-fix" ?

20:46 <d1b2> <azonenberg> i mean the anti-fix

20:46 <d1b2> <azonenberg> this is unrelated to the crash which i havent had a chance to look at

20:47 <d1b2> <azonenberg> it just causes the gui to act differently with this one scope model than every other

20:47 <d1b2> <azonenberg> which is obviously bad for what's supposed to be a universal tool 🙂

20:48 <d1b2> <fredzo_72653> OK got it

20:49 <d1b2> <fredzo_72653> Yes I figured that trigger point was à T=0 which is not consistent with what you said it should be

20:49 <d1b2> <fredzo_72653> I'll take a look

20:49 <d1b2> <fredzo_72653> to fix that

20:50 <d1b2> <azonenberg> Yeah. We specifically measure time differently from how a lot of scopes do (trigger=0) as this works better with e.g. multiple instruments

20:51 <d1b2> <azonenberg> if you can add that change to the current PR i'll review once you get that done

20:51 <d1b2> <fredzo_72653> OK no problem

20:55 <d1b2> <fredzo_72653> I'm out of luck for the sanitizer, it does not seem to be available on windows 😕

20:58 <d1b2> <azonenberg> ah ok i'll see what i can find on my end then with your dump when i have time