#jruby on 2022-12-20 — irc logs at libera.irclog.whitequark.org

2022-01-19 17:17 ChanServ changed the topic of #jruby to: Get 9.3.3.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

03:01 razetime has joined #jruby

03:48 subbu has joined #jruby

06:11 subbu has quit [Ping timeout: 268 seconds]

13:32 razetime has quit [Quit: https://quassel-irc.org - Chat comfortably. Anywhere.]

14:28 subbu has joined #jruby

16:26 subbu has quit [Ping timeout: 252 seconds]

16:31 subbu has joined #jruby

18:26 IlyaBylich[m] has joined #jruby

18:27 <IlyaBylich[m]> Hello there

18:41 <headius> Hiya!

18:41 <headius> I just replied about the naming changes, good call, I was just guessing at names

18:42 <IlyaBylich[m]> Well, then it was a very precise guess from your side :)

18:47 <headius> I can get those changes in shortly

18:54 <enebo[m]> Ilya Bylich: Is 7.0.0.9 the last version of Ragel to support -R?

18:54 <enebo[m]> I have 7.0.0.12 and it is missing

18:56 <headius> Yeah looks like Ruby support has been dropped. That doesn't bode well for using ragel going forward

18:57 <headius> Not that I have a suggestion to replace it of course

18:57 <IlyaBylich[m]> Right, it's been dropped. IIRC Ragel is no longer actively developed anymore, but before it's been abandoned author of Ragel announced that Ruby support will be added back

18:57 <IlyaBylich[m]> So 6.10 is the latest version that we can use

18:58 <enebo[m]> ok

18:59 <IlyaBylich[m]> What are you thoughts on using ivars for state management? Is it that slow comparing to lvars?

19:00 <IlyaBylich[m]> Because I haven't noticed any difference on MRI (3.0, without new object shapes)

19:01 <headius> It's likely the overhead is massively overshadowed by other stuff in this lexer

19:02 <headius> At a minimum, even with the best optimizations on MRI, they're still going to be cache validation and the memory access

19:02 <IlyaBylich[m]> Right, at least one level of indirection

19:02 <enebo[m]> It seems too bad this does not have an option to generate a state table ala array

19:02 <headius> It might show up more with YJIT or on JRuby and TR since the register will be much faster than any memory access

19:03 <headius> Without a JIT it's either going to the stack or going to an object in the CPU cache which is why you probably don't see much difference

19:04 <IlyaBylich[m]> enebo: Ragel uses jump tables, at least I see some huge arrays in compiled output

19:04 <headius> If moving some of this stuff to instance variables means we can get this method down to a jitable size that benefit probably will outweigh the overhead

19:04 <enebo[m]> I should probably look at the generated files again. It has been a while

19:04 <enebo[m]> I largely just remember the massive case/when

19:05 <IlyaBylich[m]> Yes, I also think that it's worth it. JITs will be everywhere soon and I'm actually sure that YJIT is able to properly compile it

19:06 <IlyaBylich[m]> enebo[m]: You are right, there's a big case/when, but it's used for actions (i.e. callbacks of transitions), there's a jump-table-based state machine under the hood for transitions.

19:06 <enebo[m]> ah ok

19:07 <enebo[m]> So it uses the table to lookup what actions to trigger and those actions are found in the case

19:07 <IlyaBylich[m]> Racc for example emits every "action" as a method with `_N` suffix that is dynamically dispatched, Bison technically is able to do the same with some magic, but Ragel is the worst of them :D

19:08 <IlyaBylich[m]> enebo[m]: correct

19:11 <IlyaBylich[m]> I'll try to apply approach mentioned in https://github.com/whitequark/parser/pull/899#issuecomment-1359942610 tomorrow to see the impact on MRI. If it's not terrible in interpreter mode I'll check it with JRuby/TruffleRuby/YJIT, if JITed code runs significantly faster I'll probably merge it

20:01 <headius> ok I'm back

20:02 <headius> Ilya Bylich: FWIW even with F0 we are very far off from being able to JIT this, and that node limit on TR is extremely high so I think it will be a while before either of us JIT this code

20:03 <headius> even YJIT might have trouble, I'm not sure, but even if it doesn't it's creating such a massive piece of native code it might not even fit in memory caches

20:03 <headius> I will fix those method names now

20:08 <headius> Ilya Bylich: PR is all set, though I don't have permission to run workflows

20:33 subbu has quit [Ping timeout: 260 seconds]

20:36 <headius> ok I'm done with parser stuff for now

20:39 <headius> BuildRangeInstr

20:39 <headius> range isn't even an operand now... we could make a Range operand for pure literal ranges

20:40 <enebo[m]> I believe one problem has to do with it raising on initialize

20:40 <headius> for pure literals we could make that determination at compile time

20:40 <enebo[m]> it cannot raise exception as an operand

20:40 <enebo[m]> yeah that's true

20:40 <enebo[m]> just so we know

20:40 <headius> and I'm not sure what cases could be pure literal but also fail at runtime

20:41 <headius> there may not be any

20:41 <enebo[m]> yeah I just recall there being a bunch of weird specs fixed for 3 but most were things like endless or beginless ranges

20:42 <enebo[m]> they also must be matched literals but I assume you meant that

20:42 <headius> hmm ok we actually do have a Range operand

20:42 <enebo[m]> huzzah

20:42 <headius> it just doesn't cache anything

20:43 <headius> hmm or it's not being used for a simple range like (1..2)?

20:43 <headius> investigating

20:43 <enebo[m]> heh...I am back several hundred commits and I do not see Range as an operand

20:44 <headius> ha

20:44 <enebo[m]> yay

20:44 <headius> org.jruby.compiler.NotCompilableException: no visitor logic for org.jruby.ir.operands.Range in org.jruby.ir.persistence.IRDumper

20:44 <enebo[m]> you added this last a little over a year ago

20:45 <enebo[m]> I wonder what you mixxed

20:45 <headius> how do you see that? my log doesn't show any real changes since 2015

20:45 <enebo[m]> c88ae57

20:45 <enebo[m]> I see stuff in JVMVisitor

20:46 <enebo[m]> yeah I see a Range() in JVMVisitor and a method added to IRBVisitor

20:47 <headius> ok weird intellij is not giving me he right log

20:47 <enebo[m]> HAHA IRBVisitor

20:47 <enebo[m]> I hope I never see that class

20:48 <enebo[m]> anyways this is another example of us having the same conversation

20:48 <enebo[m]> it is kind of funny how much we must enforce each other without realizing it

20:48 <headius> ok I have no idea what history intellij is showing me but I see the commits from last year

20:48 <enebo[m]> err reinforce

20:48 <enebo[m]> yeah it seems to be there using valuecompiler to push range

20:49 <enebo[m]> so that should be caching right?

20:49 <headius> it doesn't seem to be

20:49 <headius> $ jruby -e 'p((1..2).equal?((1..2)))'

20:49 <headius> false

20:49 <headius> oh but it's probably not using the same store for both

20:49 <enebo[m]> oh hahah yeah

20:49 <headius> it could in JIT but probably not in IR

20:49 <enebo[m]> no dedup

20:49 <headius> right

20:50 <enebo[m]> we maybe need some generic deduper

20:50 <enebo[m]> we have enough different types which do this

20:51 <enebo[m]> Almost also makes me wish we just had an annotation on Operand @Deduplicated which magically did it

20:51 <headius> $ jruby -e 'ary = []; 2.times { ary << (1..2) }; p ary[0].equal?(ary[1])'

20:51 <headius> true

20:51 <enebo[m]> (and by magically I mean we would implement that magic)

20:51 <headius> yeah that's possible but it would need to store it somewhere

20:51 <headius> strings have a specific place

20:51 <headius> true

20:51 <headius> $ ruby -e 'ary = []; 2.times { ary << (1..2) }; p ary[0].equal?(ary[1])'

20:51 <enebo[m]> yeah I just wonder if we can Object these things

20:51 <headius> CRuby

20:51 <enebo[m]> HAHAH

20:51 <headius> so we both do this already but no dedup

20:52 <headius> ok confusion gone

20:52 <headius> whew

20:52 <enebo[m]> Each literal is its own instance and I am not sure we need to dedup ranges

20:52 <enebo[m]> they are small and there are not many of them

20:53 <headius> yeah it's not a big deal

20:53 <headius> caching is sufficient

20:53 <enebo[m]> caching per site definitely makes sense

20:53 <enebo[m]> So I think I just realized why the mocha thing stopped working at the bisect commit I found

20:54 <enebo[m]> it updated Ruby 3 stdlib and gems to default and I looked at those and thought...no way these are causing any issues with backtraces

20:54 <enebo[m]> I just noticed this was the same commit which also bump language version

20:54 <headius> aha

20:54 <enebo[m]> So now I have to see which code path is version guarded in mocha or minitest or rake

20:55 <enebo[m]> The bug is maddening

20:55 <enebo[m]> it takes a block in a test and then adds that block via define_method into a class which extends minitest+mochatest then makes an instance and runs that test

20:56 <enebo[m]> our backtrace shows 'test_me' which is some dummy name they give this method

20:56 <enebo[m]> MRI will show the method name of the place the block was originally defined in

20:56 <enebo[m]> I find this inscrutable since it is actually calling a method call test_me...why would that not be in the backtrace

20:57 <enebo[m]> There will be an explosive aha moment here

21:00 <headius> so is the backtrace issue the bug?

21:00 <headius> or is that just making it hard to find the bug

21:00 <enebo[m]> it is returning wrong method name from processing backtrace/caller

21:01 <enebo[m]> if I print them out raw I can see MRI setting the test_me line as the test_something_else_from_original_file

21:01 <enebo[m]> It is like its original location is somehow overriding its name in the backtrace vs what the method is called

21:02 <enebo[m]> which if I think about it I can convince myself that lexical location of block may be more useful in a backtrace?

21:02 <enebo[m]> but it still needs both since it is also a method that is being called

21:03 <enebo[m]> The more I think about this the more I can see why this is a backtrace like this since the line it fails at is in the original source file

21:04 <enebo[m]> but the source file/line is right. The method name is the define_method one and not the lexically containing one

21:04 <enebo[m]> which sort of feels more right to me

21:05 <enebo[m]> I think one thing we have not implemented is pointing out 'block in method_name'

21:05 <enebo[m]> err I take that back

21:05 <enebo[m]> We do that but we do not do the repeated line thing

21:06 <enebo[m]> block 3 times in or whatever that is

21:25 subbu has joined #jruby

21:46 subbu has quit [Ping timeout: 255 seconds]

22:02 subbu has joined #jruby

22:15 sagax has quit [Ping timeout: 260 seconds]

22:35 <headius> got pulled away on some business

22:35 <headius> you might try forcing JIT and see if the JVM trace is more helpful

22:36 <headius> sometimes the interpreted trace gets screwy if people are messing with bindings and stuff but the JVM trace usually just uses the real file and line

22:40 <enebo[m]> I think I figured out the issue

22:40 <headius> nice

22:40 <enebo[m]> Well I definitely figure out an issue and it fixes the reported problem

22:40 <headius> what did I do

22:41 <enebo[m]> procs sent to define_method use their defining hard scope as the name

22:41 <enebo[m]> This has to be a change

22:41 <enebo[m]> you did actually write the code to change this to be the defined method name...in 2008

22:41 <headius> so they rewrite the name before posting the method

22:42 <headius> oh rewriting is actually wrong then?

22:42 <headius> it should use whatever name it had at definition

22:42 <enebo[m]> I need to see if I can observe it changing or not between versions to figure out when it differs

22:42 <enebo[m]> yeah. so test_me which is define_method name is wrong now it should be where the proc came from

22:43 <enebo[m]> This annoys me somewhat

22:43 <enebo[m]> If define_method is an analogue to def then it should report backtraces similarly

22:43 <enebo[m]> The actual file:line is to the proper lexical location where the block is already

22:44 <enebo[m]> I only have one big question on this...why the hell did this work for 9.3?

22:44 <enebo[m]> I never found a version guard

22:45 <enebo[m]> ok my change breaks a bunch of stuff but I changed two values with name

22:45 <enebo[m]> method and callee both use define_method name (which makes sense)

22:47 <enebo[m]> HAHAH

22:47 <enebo[m]> Ruby 2.6 also gives lexical location of proc

22:48 <enebo[m]> So some commit did change this but bisect did not resolve to that commit

22:48 <enebo[m]> and to be fair there was a section of commits which was building jruby.jar but blowing up while erroneously trying to install some default gems

22:49 <enebo[m]> So perhaps I was not testing against newer jruby.jar (but I was fairly sure I was fine by that point)

22:50 <enebo[m]> This won't be too hard to track down now that I realize it is a regression on how we are setting up define_method

22:56 <headius> It definitely makes more sense to use the defined name rather than the lexical surrounding name

22:56 <headius> And of course the lexical file and line number are still there so I don't see why they use the lexical name. As in your example comment, that name might be nothing if the proc came from a script top level

22:58 <enebo[m]> I have a theory

22:58 <enebo[m]> the same proc can be used for n method defs

22:59 <enebo[m]> and they are grabbing location from proc (or something like that)

22:59 <enebo[m]> so they normalized on something which would not change

22:59 <enebo[m]> The other theory is it has been this way so long they don't dare correct it now

22:59 <headius> I guess it's sort of makes sense in a roundabout way but it seems far less useful

23:00 <enebo[m]> well it doesn't really make sense though

23:00 <enebo[m]> who cares what method originally created the proc

23:00 <headius> So now not only do you not know the place where it was defined, you don't even know what name it was defined under

23:00 <enebo[m]> especially since its location is there

23:00 <headius> I don't see how that's better

23:01 <enebo[m]> yeah but mocha leverages the fact that the last caller element will have the test name :)

23:01 <headius> Anyway as you said in the bug, we have to be compatible so of course we'll change it but I think It's wrong

23:01 <enebo[m]> I don't blame them since it seems like it will always work but this is a pretty weird backtrace element