#jruby on 2023-03-02 — irc logs at libera.irclog.whitequark.org

2023-02-07 16:51 ChanServ changed the topic of #jruby to: Get 9.4.1.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

04:11 <headius> heh, found a last minute bug in backtrace limit

04:12 <headius> it was always printing "... # lines ..." because of a partial refactoring bug

13:02 <headius> well that turned out way easier than I thought: https://github.com/jruby/jruby/pull/7702

15:07 <enebo[m]> headius: so getFrameName might now start returning the combined name?

15:07 <enebo[m]> I largely just skimmed to the combining bit

15:09 <enebo[m]> I half wonder if we should just put those two methods onto ThreadContext (getCalleeName/getSuperName) and then deprecate getFrameName

15:10 <headius> enebo: I did that later

15:10 <headius> all uses of getFrameName now use one or the other but I didn't deprecate it, I should do that

15:10 <enebo[m]> oh I did not see that in the PR

15:11 <headius> I force pushed a few times

15:11 <headius> you might have seen an older version

15:11 <enebo[m]> HAHA

15:11 <enebo[m]> does github just update things on a force

15:11 <enebo[m]> I swear I was just looking at a different diff like a minute ago

15:12 <enebo[m]> I did not hit reload either

15:12 <headius> when did you open it? Most of that was from last night I think

15:12 <headius> ah yeah they do have some refresh pings

15:12 <enebo[m]> lol. I have no idea but the stuff I said I thought was there and then when I went back it was all context

15:12 <enebo[m]> funny

15:13 <headius> interesting, I see a few places left that use getFrameName... I will fix them and deprecate I guess

15:13 <enebo[m]> yeah otherwise we will get a very strange bug report one day

15:13 <headius> some of the errors along the way were fun

15:13 <enebo[m]> because I will definitely use it not remembering

15:13 <headius> no such method "\0foo\0bar"

15:14 <enebo[m]> yeah

15:14 <enebo[m]> for me I may use it for debugging print statements

15:14 <headius> I think I have everything green now though

15:15 <enebo[m]> I think I am going to be working on ARJDBC until probably you are back just to try and get this done

15:15 <headius> ok some of these are valid, just propagating whatever the frame name is

15:15 <enebo[m]> I realize in the past for most updates it is just syncing a few files but postgresql is not as aligned as mysql/sqlite3 and I think the changes are larger for 7

15:15 <headius> I could make a new name for the method and still deprecate, or just leave this

15:16 <headius> we doing a 9.4.2 next week?

15:16 <enebo[m]> you will be able to help or no?

15:18 <headius> yes some of the days

15:18 <headius> we are planning to spend most days at the AirBnB by the pool

15:19 <enebo[m]> headius: yeah let's set a day that works for you and I can pre-test so we can shorten the time

15:19 <enebo[m]> you can IM me or what not if you don't know yet

15:20 <headius> I don't have anything else for 9.4.2 right now

15:20 <headius> ok

15:20 <headius> I should try reverting ripper/lexer.rb and see how it runs with this branch

15:20 <enebo[m]> you can help untangle arjdbc with me then or hit some simple specs

15:21 <enebo[m]> oh yeah that is a primary reason for the work to not be out of sync with the only real world use case

15:21 <enebo[m]> Someone did specifically report the issue as well but I was unclear how real it really was since he went big with it being a primary tool

15:22 <headius> yeah hard to believe this feature dates back to 201

15:22 <headius> 2012

15:28 <headius> I pushed reverted lexer, we'll see how it goes

15:35 <headius> you know I juggled all these GHA jobs around to put longer-running jobs up front, but it seems like they end up getting scheduled randomly anyway

15:36 <enebo[m]> it is funny you cannot order them

15:36 <enebo[m]> I have noticed the Mac queue must be a tiny number of available workers

15:37 <enebo[m]> I almost wonder if we should move those to some nightly job with notification if they fail

15:44 <headius> stdlib passed with reverted lexer

15:45 <headius> the mac queue is one worker

15:45 <headius> I'm not sure if it's simple to run more or not but this is the donated system

15:46 <headius> there's no docker isolation or anything so the jobs might step on each other

15:46 <headius> sockets etc

15:47 <enebo[m]> We could potentially run it less since I see it mark jobs as unfinished unless we only make a commit once every hour sort of thing

15:47 <headius> yeah it was stalling for some reason recently but seems ok today

15:48 <headius> now I am wondering what else we can embed into the frame name 😀

15:48 <enebo[m]> perhaps in the last week or two it has been stressed by other projects using same physical hardware or something

15:48 <headius> I think it's a dedicated machine but I can't be certain

15:48 <headius> sitting in some rack somewhere

15:48 <enebo[m]> kwarg positional descriptor

15:49 <headius> yeah that was my first thought

15:49 <enebo[m]> but with that said anything which is not used very much would be fine since it would not require decoding

15:49 <enebo[m]> or less decoding

15:49 <enebo[m]> but it should be bidirectional in not needing to be encoded either

15:55 <headius> yeah this is fun but should be a temporary fix

16:36 <enebo[m]> Can I merge your callee PR?

16:37 <enebo[m]> headius: I it is very mildly possible this is a failure in AR unit tests

16:38 <enebo[m]> I did wonder if it is possible anyone else is using getFrameName in an extension

16:38 <enebo[m]> If so then it may be worth having 3 new methods and have getFrameName still just return getSuper

16:43 <headius> that might be better

16:43 <headius> I can switch the valid uses of getFrameName to getCompositeName or something

16:45 <headius> trivial refactory... not sure about that name

16:47 <headius> https://gist.github.com/headius/92600725a89287189b2dc71a3d01747c

16:49 <enebo[m]> I think the name is ok but it is much better than using framename for intent

16:50 <enebo[m]> my only other thought is getEncodedNames but if you figure out another thing to mangle into that then perhaps even using name is problematic

16:50 <enebo[m]> but I guess we shouldn't get too bogged down on this. This last thought was just to make sure any external consumers do not suddenly get weird names

16:53 <headius> yeah hopefully we can just remove this once we don't need to do this encoding

16:53 <headius> famous last words... I'll be asking why we never got rid of it in 2033

16:54 <headius> I'll go with getCompositeName

17:00 <headius> ok that should do it

17:05 <enebo[m]> cool

17:19 <headius> looks pretty good

17:19 <headius> enebo: if you are comfortable throwing this into 9.4.2 I'm fine with it

17:20 <headius> that last change was a good idea, lowers risk of any extension getting the weird name

17:21 <headius> so it's down to internal uses, and hopefully I found all those

17:21 <enebo[m]> I think I am ok with that but it really only would have fallout if we were accessing that value by another method we are not thinking about

17:22 <enebo[m]> getFrame().getName

17:22 <headius> I audited those early in the patch but I'll have another look

17:23 <enebo[m]> before PR only 3 other uses

17:23 <headius> yeah they are all now just setting up another frame copy or binding

17:23 <enebo[m]> but I suppose if someone calls getFrame().getName() in an extension they may have the same issue of getting a weird value

17:23 <headius> propagating the composite name

17:23 <enebo[m]> with that said I am not sure this is very likely

17:24 <headius> I could extend the getCompositeName to this level of course but I dunno

17:24 <headius> and then to Binding too

17:24 <enebo[m]> yeah it would completely remove the risk but it also is embedding this semantically into frame

17:24 <enebo[m]> but with that said Frame does make sense for the two names right?

17:25 <enebo[m]> It has a super name and a callee name on it

17:25 <enebo[m]> how it is stored is a different issue

17:25 <headius> yeah there's just no way to pass along two names so it has to be composite somewhere on stack

17:25 <headius> so then there's three forms of it everywhere

17:25 <headius> I didn't decode it in frame because it may not get used and that would add allocation

17:26 <headius> I mean decode it eagerly

17:26 <enebo[m]> yeah I just mean it may be stored on frame as composite and then stored/accessed with weird method on Frame but all uses would frame.getCallee

17:26 <enebo[m]> moving your logic into frame I suppose

17:26 <enebo[m]> I don't know

17:26 <headius> yeah pretty much

17:26 <headius> but util methods still on TC to avoid code accessing frames directly

17:27 <enebo[m]> Is frame in same package as TC?

17:27 <headius> FWIW there's six uses of Frame.getName... five are internal to TC and one is in IRRuntimeHelpers.newFrameScopeBinding

17:27 <headius> yes

17:28 <enebo[m]> We cannot do this either but we should consider removing accessors from frame and make it package protected

17:28 <headius> yeah not a bad thought at some point

17:28 <enebo[m]> 9.5 or something...I don't even know how we can advertise visibility change

17:28 <headius> I have wondered about hiding it more or exposing it more, because escape analysis might be able to get rid of heap frames and scopes when things inline

17:28 <headius> but depending on that puts us in a precarious position

17:29 <headius> going to TC for it means it can never be eliminated

17:29 <enebo[m]> our pseudo bump pointer thing is another issue with that

17:29 <headius> right

17:29 <enebo[m]> I find it interesting to consider sphaghetti again for some fields like self

17:30 <headius> just pass everything on stack

17:30 <enebo[m]> Not because it will solve that problem per se but because we will basically use the same self for like 10 calls in a row and stuff that value ni 10 places

17:30 <headius> 30 base arguments to every method

17:30 <enebo[m]> well that is the ultimate until it isn't

17:30 <enebo[m]> Wasn't Scala complaining about some limit on param length

17:31 <enebo[m]> a bunch of these fields are only current value and last value

17:32 <enebo[m]> so 10 params for Frame

17:32 <enebo[m]> :)

17:32 <enebo[m]> but not all need previous value so less

17:32 <headius> yeah no problem

17:32 <enebo[m]> callInfo too

17:32 <enebo[m]> kwargs indfo

17:32 <enebo[m]> for positional knowledge

17:32 <enebo[m]> a few other bits and bobs but it would all be on the stack at least

17:33 <headius> yeah Truffle is able to do this for you by automatically eliding their frame objects when things inline, if they inline

17:33 <headius> so you just allocate a frame

17:34 <enebo[m]> that is nice but we don't get that so I think we have massive param list

17:34 <headius> that's odd, my last push broke one test

17:34 <headius> maybe I untagged something that wasn't quite ready

17:48 <headius> enebo: jeremyevans made an interesting discovery

17:48 <headius> the time to parse a method is quadratic on the number of parameters (or variables) in both CRuby and JRuby

17:49 <headius> I just traced it to StaticScope.findVariableName, which is a linear search and done for each new variable encountered

18:14 <headius> example script: jruby -e 's = 8000.times.map{|i| "c#{i}"}.join(","); loop { t = Time.now; eval "def foo(#{s}) end"; puts Time.now - t }'

18:14 <headius> I have a simple patch that uses a Map to track existing variables, but I don't know if we want to keep this map around forever

18:14 <headius> https://gist.github.com/headius/5700bac0fca84999769ec4b7ef5e3a05

18:15 <enebo[m]> hahah

18:15 <enebo[m]> Is this a real issue though?

18:16 <enebo[m]> Time.new and some big kwargs lists are still <15 and most methods are very small

18:21 <headius> yeah unlikely

18:22 <headius> probably why we never bothered to make this better than linear search

18:22 <headius> we also grow the internal names array one at a time, so we're recopying it a lot

18:22 <headius> dunno if that's worth fixing either but it's zero cost to do so

18:30 <headius> hmmm

18:30 <headius> small issue with the callee thing

18:30 <headius> it will work in define_method but only when we convert it to a real method

18:30 <headius> if it's a normal proc define_method method we freeze the name into the frame and it does not get updated with alias

18:33 <headius> ugh, because we don't pass name into blocks

18:34 <headius> probably need to be able to convert capturing define_method methods in order to fix this

18:34 <headius> blocks do not have logic to push a new frame so there's no opportunity to update it

18:34 <headius> I only discovered this in CI because we run some suites with --dev which won't do the define_method optimization

18:39 <headius> or convert it to some intermediate form that can still push a regular frame

18:40 <enebo[m]> headius: zero cost needs to consider memory but I suppose it depends on whether we flush that data at some point

18:40 <headius> I also noticed this error in test:mri:core:jit so those might not be running JIT actually 😬

18:41 <enebo[m]> There is an extra object created per scope which does go away in ParserBases

18:41 <headius> enebo: if the parser could call StaticScope.compact when it's done we can do whatever we want and throw it away

18:41 <headius> more memory but only temporarily

18:41 <headius> or parser starts tracking seen vars on its own in transient state

18:41 <enebo[m]> private Map<RubySymbol, Integer> definedVariables;

18:41 <enebo[m]> The map already exists

18:41 <enebo[m]> just not in StaticScope

18:42 <headius> heh figures

18:42 <enebo[m]> So I suppose if we used this instead of StaticScope we could then mass assign once and then not grow either

18:42 <headius> only used for warning

18:42 <headius> it seems

18:43 <headius> right

18:43 <headius> that would be win win win!

18:44 <enebo[m]> this seems to always be on

18:45 <enebo[m]> although it is inverted in how it works

18:45 <enebo[m]> since we examine staticscope first

18:45 <enebo[m]> we still have to examine parent staticscopes which would be done but then for current staticscope not use it until the scope is done

18:46 <enebo[m]> but this could get rid of jeremy's observation and eliminate all the arraycopying

18:46 <enebo[m]> which probably is a bit weird we try to pinch that memory so tight

18:47 <enebo[m]> I will open an issue on this

18:49 <headius> yeah ok, seems like there's probably a good net positive from fixing this in parser

18:49 <headius> https://github.com/jruby/jruby/pull/7704

18:49 <headius> here's hoping a bunch of stuff doesn't suddenly show up failing

18:53 <enebo[m]> https://github.com/jruby/jruby/issues/7705

18:54 <enebo[m]> I realized two things on this 1) it needs to give out a slot value which is just an int count incrementing 2) it probably also needs to maintain order found

18:54 <enebo[m]> for 2 I am not positive this matters but my nose is twitching

18:54 <headius> it would be nice if this were a Map<String, int> too

18:55 <enebo[m]> yeah that's true

18:55 <headius> int nextIndex = definedVariables.size; definedVariables.put(name, nextIndex); return nextIndex;

18:55 <headius> eliminates all dynamic variable table stuff from StaticScope

18:55 <enebo[m]> that is not really a big problem since lvars are eagerly made

18:55 <headius> needs to be a linked hash map so the order reflects the index when we pull it out

18:56 <enebo[m]> we are eager on things which will exist before they exist but it simplifies a lot of things

18:56 <enebo[m]> yeah it can be and that does solve 2 and not require an int since we can use length

18:57 <headius> LinkedHashMap<String, int> but we might have to implement it

18:57 <enebo[m]> All the machinery of addVariable will disappear as well and just leverage (probably) how we load static scopes from IR persistence

18:57 <headius> or accept the Integer objects at parse time

18:57 <enebo[m]> we are accepting Integer today but I guess we could save on those

18:58 <headius> we are already paying integer cost today that's true

18:58 <headius> so it's still a net reduction in alloc

18:58 <enebo[m]> I think the main plus here is just a single alloc in staticscope

18:59 <headius> right

18:59 <enebo[m]> but linkedhashmap has to be cheaper than all that (although we still need a Map of some kind for other reasons)

18:59 <enebo[m]> the lookup for nearly all methods is fine being linear

18:59 <headius> LHM might cost more than what we're populating now in definedVariables, not sure how that cost will break down

19:00 <enebo[m]> so I doubt that part of things is a visible cost unless you are jeremy :)

19:00 <headius> I don't think LHM is much more memory than HM though

19:00 <headius> just another field or walking nodes

19:00 <headius> or=for

19:01 <enebo[m]> unless what we choose creates more :)

19:01 <enebo[m]> in any case I think removing some temp object churn is a good motivation

19:01 <enebo[m]> at worst case I maintain a larger array and balance it :)

19:01 <headius> yeah

19:01 <headius> do it

19:01 <enebo[m]> haha

19:01 <headius> callee CI is back to green

19:02 <enebo[m]> actually that would be very simple

19:02 <headius> test_callee got untagged because it ran locally but --dev breaks in define_method

19:02 <headius> so we are not 100% on this feature but probably 99% level

19:02 <headius> broken = unoptimized (or unoptimizable) define_method plus __callee__

19:02 <headius> that's all I know of

19:03 <enebo[m]> one larger array which is unlikely to grow but would still require that which maintains order and has an int for size. The set still for the thing happening today. one array copy to mass assign to staticscope

19:03 <headius> yeah that's another option

19:03 <headius> since we need a linear array eventually anyway

19:03 <enebo[m]> A single alloc of temp 50 String[] is not a big thing

19:04 <enebo[m]> and who knows how small it could be but that would be something we would not want to ever resize normally

19:04 <enebo[m]> anyways there are multiple ways

19:04 <enebo[m]> The non-linear lookup I now think could be a bigger problem

19:05 <headius> oh?

19:05 <enebo[m]> I only considered parameters

19:05 <headius> yeah it's all vars

19:05 <enebo[m]> so I think we easily see methods with 50+ lvars

19:05 <enebo[m]> most have very few or none but it is not that uncommon to see a method with 30 vars in it

19:06 <headius> yeah but we don't constantly recompile them

19:06 <enebo[m]> maybe a little uncommon

19:06 <headius> so it's a one-time cost other than runtime generation of methods

19:06 <headius> but it's a cost

19:06 <headius> maybe significant across a lot of code in a codebase

19:06 <enebo[m]> yeah maybe

19:07 <enebo[m]> I think it could be more than I originally thought but I am unclear if you could measure it in the scheme of loading and running code

19:08 <headius> well the total cost of this benchmark is the n log n linear searches for new vars and the n log n copying of variables as the array grows

19:08 <headius> on top of the other map we already have

19:10 <enebo[m]> yeah

19:10 <enebo[m]> I just don't know how big of a deal that is in practice

19:18 <headius> looks like only two new failures from removing --dev from CI jobs

19:19 <headius> one is a source line off by one and the other is some frozen error

19:19 <enebo[m]> Sorry I missed the details of that no --dev or just a bunch removed?

19:20 <headius> we were passing --dev to a bunch of suites that wanted to jit

19:20 <enebo[m]> oh hahah

19:20 <enebo[m]> yeah I can see how that happened

19:20 <headius> yeah I noticed it because this callee thing only failed with --dev locally, but it was failing in the jit jobs

19:20 <enebo[m]> That's one way to make them pass

19:20 <headius> it wasn't as an env var but clearly was propagating into MRI tests

19:20 <enebo[m]> good find

19:21 <headius> https://github.com/jruby/jruby/actions/runs/4317226993/jobs/7534028106

19:21 <headius> so these should get fixed

19:22 <headius> the string freezing one might be tricky since we don't guarantee the string will be the same if it runs in interpreter and then against in bytecode

19:23 <headius> callee is super green, should we go for it?

19:23 <headius> has not run with new --dev change thought 😀

19:23 <headius> s/thought/though/

19:24 <enebo[m]> merge callee

19:25 <enebo[m]> I see a failed test in AR which is unlikely to be from that but it just happens to use callee

19:25 <headius> it is done

20:07 <headius> so I'm looking at these jit failures

20:07 <headius> one is some enumerator arity thing that's due to how jit routes calls differently... low priority but should be fixed

20:07 <headius> one is source location off by one

20:07 <headius> one is frozen non-interned string is not same string the next time because of jit

20:08 <headius> the only two that seem of possible concern are due to this:

20:08 <headius> [] jruby $ jruby -e 'def foo(&b); lambda(&b); end; def bar(&b); foo(&b); end; p bar { }.lambda?'

20:08 <headius> [] jruby $ jruby -X-C -e 'def foo(&b); lambda(&b); end; def bar(&b); foo(&b); end; p bar { }.lambda?'

20:08 <headius> true

20:08 <headius> false

20:08 <headius> I mean they should all be fixed but for 9.4.2?

20:10 <enebo[m]> they were already broken right?

20:11 <enebo[m]> we just noticed them not working today

20:11 <headius> ye

20:11 <headius> kanye

20:11 <enebo[m]> yezus

20:11 <headius> yes, just exposed now in this one suite

20:11 <enebo[m]> yeah I don't think it matters but it would be nice if they are not horribly hard

20:12 <enebo[m]> What the hell is this test

20:12 <headius> I'll see what I can do

20:13 <enebo[m]> Is that lambda test meaning that the .lambda is on the result?

20:13 <enebo[m]> if so shouldn't it print true?

20:16 <headius> so near as I can tell if the block gets reified into a proc this way it should not become a lambda when passed to lambda

20:16 <headius> simple and full do reify_closure

20:16 <headius> JIT does not, I think because we run that &b passthrough optimization

20:16 <enebo[m]> can't any proc become a lambda from being put into lambca?

20:16 <headius> apparently not

20:16 <enebo[m]> heh

20:16 <headius> false

20:16 <headius> $ jruby -e 'p lambda(&proc{}).lambda?'

20:17 <headius> once a proc always a proc

20:19 <enebo[m]> very weird

20:19 <headius> this optimization seems to be runinng for full and JIT but full works

20:19 <headius> so it's not actually optimizing delegation there

20:19 <headius> and full still has reify_closure

20:19 <headius> it's OptimizedDelegationPass

20:19 <headius> -d

20:21 <headius> ahh it doesn't work in full because the local variable escapes

20:21 <headius> but why, hmm

20:21 <headius> it should be able to be a temp

20:22 <headius> oh hahah

20:22 <headius> it runs out of order?

20:24 <headius> these passes seem to be running in random order

20:38 <headius> oh boy

20:38 <headius> yeah

20:38 <headius> full doesn't use any list of passes, it has a few hardcoded in a weird order

20:39 <headius> https://github.com/jruby/jruby/blob/f102301f4c842764ebc6fd172692ffeba8780f72/core/src/main/java/org/jruby/ir/IRManager.java#L362-L366

20:39 <headius> delegation pass needs to run after opt dyn scopes otherwise the proc seems to escape

20:40 <headius> shouldn't these passes be smart enough to not run when dynscope is needed anyway?

20:48 <headius> heh ok this is a tricky problem

21:03 <headius> https://github.com/jruby/jruby/issues/7706

21:03 <headius> https://github.com/jruby/jruby/issues/7707

21:15 subbu has joined #jruby

21:36 <headius> https://github.com/jruby/jruby/issues/7708

21:36 <headius> https://github.com/jruby/jruby/pull/7709

21:36 <headius> https://github.com/jruby/jruby/issues/7710

21:36 <headius> I have to go now though

21:36 <headius> the last two errors are due to frozen strings getting created twice and a line number that's off

21:36 <headius> will try to file something or fix them later

21:37 <enebo[m]> nice

21:37 <headius> only the enumerator thing was easy

21:40 <headius> none of these really need to be in 9.4.2

21:58 <headius> https://github.com/jruby/jruby/issues/7711

21:58 <headius> https://github.com/jruby/jruby/pull/7712

22:06 <headius> ok I'm really done now

22:07 <headius> so I have tagged off these five failures with links to the errors, in the --dev PR

22:07 <headius> https://github.com/jruby/jruby/pull/7704

22:08 <headius> two of the failures have patches; enumerator fix is simply removing some arities and autoload fix is to use real stack trace for line number (but that breaks some tests due to canonicalization)

22:08 <headius> I've marked all the bugs and PRs for 9.4.3 but have a look at them

22:08 <headius> we can merge the --dev PR any time

22:08 <headius> ttfn

22:28 subbu has quit [Quit: Leaving]