#jruby on 2023-03-14 — irc logs at libera.irclog.whitequark.org

2023-03-08 19:39 ChanServ changed the topic of #jruby to: Get 9.4.2.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

03:25 sagax has joined #jruby

06:49 prashantz[m] has joined #jruby

09:41 <kares[m]> JavaClass is meant to go away eventually, some methods are still used (internally) but most are not or only from also deprecated callers

09:43 <kares[m]> been trying to supress some of the useless warnings there - not sure what the current status is will need to poke around a bit to see

09:43 <kares[m]> where's this coming from, trying to get to Java 17 and it gets a lot of warnings?

11:55 <headius> kares: take a look at the PR I merged, I managed to move most of the utility functions out to non-deprecated classes and eliminated pretty much all of the legacy JI warnings

11:56 <headius> And yeah there are way more deprecation warnings on 17 or 19 but I could really only fix a few of those since they depend on newer APIs then we can use

12:25 <kares[m]> okay - yeah I've seen the PR and started post-reviewing the changes ... so far so good

13:50 <kares[m]> well done handling the cleanup!

14:39 <headius> kares: thanks! If we jump to 17 for 9.5 we can fix nearly all of the Java deprecations

19:22 <enebo[m]> ugh

19:22 <enebo[m]> https://github.com/jruby/activerecord-jdbc-adapter/commit/e96af8f2d0fb5960a93b5a2e6f01e7f3809ac6cf

19:23 <enebo[m]> kares: I am fairly sure this is just a bad pattern match too

19:23 <enebo[m]> but this is probably good enough where I will spin a postgres release today

19:26 <enebo[m]> there is one other really obvious issue where we return timestamp and not a datetime but we fail ~100 F/E which is better than 99.8%

19:33 <headius> yay

19:46 <headius> https://github.com/jruby/jruby/pull/7673

19:46 <headius> enebo: so I was reviewing some of my PRs

19:47 <headius> what I have now is basically everything except line, coverage, and c_* events

19:48 <headius> the c_* events could possibly be added but since they have to wrap around Java calls they require a good bit of indy logic and would make the method stub objects bigger

19:48 <headius> in fact this all makes generated bytecode bigger and makes indy necessary to avoid overhead when trace is off

19:49 <headius> I could commit this still guarded behind --debug and just call it a win on lower-cost tracing but I don't know if that has much value

19:49 <headius> so I dunno

19:50 <enebo[m]> yeah

19:50 <headius> c_call and c_return can be done with indy too, always on in the DynamicMethod stubs and wrapped around the target in indy mode

19:50 <enebo[m]> I guess landing it in that form is least that can be done

19:50 <headius> maybe I should instead do another pass trying to make indy always-on

19:50 <headius> yeah

19:50 <enebo[m]> My main concern is warmup and eventual speed

19:51 <headius> I don't want to lose it but the additional bytecode for every method and block body is irreducible

19:51 <enebo[m]> without reasonable measurement of impact it is hard to think we just roll with it

19:51 <enebo[m]> but putting it in behind --debug at least gets the impl into the code base

19:52 <headius> the original goal was to get call and c_call working for the tracing enhanced test asserts so we don't have to pass --debug (or don't warn when it is omitted) and this does not get c_call

19:53 <headius> that enhanced assert just uses tracing to produce a better error message

19:53 <headius> so...

19:54 <enebo[m]> any thought on longer term idea of just passing line into callMethod ish stuff and hiding this all not in bytecode or indy?

19:54 <headius> that's essentially what the indy c_call does... wraps the target with a hook checking call site that does nothing if hooks are not on

19:54 <enebo[m]> It still translates to bytecode over all so there is a check but the actual mechanism to just keep it on the stack seems like something we could do once we put other stuff in that param list

19:55 <headius> the non-indy version requires backtrace which jitted code does not emit

19:55 <enebo[m]> err not in the generated version but somewhere

19:56 <headius> the indy version of c_call basically just embeds the file and line into the call site metadata so it can be used if hooks are ever turned on

19:56 <headius> it adds an additional safepointish thingy to every bound indy call

19:56 <headius> "should" boil away when hooks are off but it has to get optimized to do that

19:57 <headius> I dunno, I really would love to do always on hooks but without some sort of deopt it adds at least size and time... larger bytecode, more method handles, longer to optimize away

19:58 <enebo[m]> yeah

19:58 <enebo[m]> My concern is what cost is it to eliminate needing the flag

19:58 <enebo[m]> personally it is mildly annoying but things like coverage tend to be in testing where adding a flag is not a big deal

19:59 <headius> it's basically ALOAD + INVOKEDYNAMIC for every entry and exit of Ruby methods or blocks right now

19:59 <enebo[m]> sounds significant

19:59 <enebo[m]> since nearly all of Ruby is a call

20:00 <headius> I'm more concerned with the size increase for small methods

20:00 <headius> I think I'll commit it as is but still guard call/return/b_call/b_return behind the flag

20:01 <enebo[m]> so what new stuff is on?

20:02 <headius> raise and thread events will always be one now because I figured they're high cost anyway

20:02 <headius> (still guarded behind a boolean check for events being on, so it only adds an if to normal case)

20:02 <enebo[m]> ok

20:03 <headius> ok

20:03 <enebo[m]> going for a walk

20:09 <headius> oh one other small bonus: this moves all trace hook management out of Ruby.java into a new class TraceEventManager

20:23 <headius> ok it's done

20:28 <headius> ok next item I have questions about: https://github.com/jruby/jruby/pull/7600

20:28 <headius> so that apurtell who has the CVE concerns can't upgrade to 9.4.x because of a readline/reline issue in the hbase shell they are supporting

20:28 <headius> I am not sure what to do

20:28 <headius> I asked him to report those issues, but this puts pressure on us to reconsider updating Psych and SnakeYAML in 9.3.x

20:47 <headius> enebo: a question about setting call info when you get back... I see we are setting it to 0 now all the time for non-kwarg calls but I'm wondering if there's any way to eliminate that

21:09 <enebo[m]> headius: back

21:12 <headius> yo

21:13 <enebo[m]> callInfo is complicated

21:13 <headius> I was just noticing it is around all === calls for case/when too as I revisit my bytecode reductions there

21:14 <enebo[m]> ah yeah

21:14 <enebo[m]> so there are a few things to think about with callInfo atm

21:14 <enebo[m]> 1. any method can accept an empty **kw and not complain

21:15 <enebo[m]> 2. I actually went the route of if/elsing on this in IR code itself so Ruby will not send that but that needs to change

21:15 <enebo[m]> 3. I want to kill most of those keywords=true for calls which merely forward kwargs to the next method

21:16 <enebo[m]> 4. We have not really formalized how we want methods to accept kwargs

21:16 <enebo[m]> that last one is not so important for today but it is something

21:17 <headius> yeah so really the ultimate path forward is passing the arguments along but that's a long project

21:18 <enebo[m]> yeah

21:18 <headius> so in the shorter term there's reducing setCallInfo on one hand and rolling into indy call sites on the other hand

21:18 <headius> given that it's a single thread-local bit of information reducing it more gets trickier

21:18 <headius> we have to set it and clear it or state will leak

21:19 <enebo[m]> My current experiment was to shift kwarg state to second part of callInfo so forwarding could be eliminated

21:20 <enebo[m]> but that is independent of any check to make sure we do not arity error on an empty kwrest

21:20 <headius> right hmm

21:20 <enebo[m]> so one access and one shift + set

21:20 <enebo[m]> but that shift + set will pay for itself in brittleness of what we have

21:21 <headius> I ask because I was wondering if we are clearing callInfo reliably do we need to set it to 0

21:21 <enebo[m]> the if/else in IR itself does solve the empty thing largely but it very expensive compared to a bit check

21:21 <headius> but that is assuming we clear it for certain after everywhere it should be consumed

21:22 <enebo[m]> so if you don't set it and it stays the same value any next method after a native method thinks it recieved kwargs whether it did or not

21:22 <enebo[m]> If you marked a bit on it saying I entered native (an idea I had considered) it would be possible to know something but you are still messing with that int

21:23 <enebo[m]> I think MRI should have errored on empty kwargs

21:23 <enebo[m]> mri31 -e 'def foo; "HEH" end; h = {}; p foo(**h)'

21:23 <enebo[m]> This is dumb

21:24 <headius> yeah I agree with that in principal

21:24 <enebo[m]> and for us it means bifurcating a call at some point to not pass it or have all methods need to check for it

21:25 <enebo[m]> Current design in Ruby code is to bifurcate in IR itself but I think it should get put into call impl

21:25 <headius> what would you think of putting call info into CallBase as call metadata right now, so I could stick it in indy logic

21:25 <enebo[m]> The original if/else happened when I thought you could flag hashes as keyword hashes

21:25 <headius> it really is call metadata after all

21:25 <enebo[m]> now all calls know if they receive kws and at times whether they are already empty

21:26 <headius> all call instr invokes will have to set CallInfo but then I can embed it in indy and eliminate the extra bytecodes at least

21:26 <enebo[m]> yeah it just was a potential opt but as it stands it is probably just more bytecode in JIT

21:26 <enebo[m]> interp is slower but only if you explicitly are using kwargs at a site

21:26 <enebo[m]> all receivers in Ruby pay no cost

21:27 <enebo[m]> if we were RiR this would be a bit simpler too

21:28 <headius> the two things I'd like to do tonight/tomorrow is both involve making calls smarter:

21:28 <enebo[m]> removing bifurcation altogether would mean checking everywhere but it would be extremely rare and a flag bitwise op which compared to all the other machinery may not show up for much more than math microbenches

21:28 <headius> 1. call info as call metadata so I can include it with the call itself

21:28 <enebo[m]> and probably not then

21:28 <headius> 2. boolean return value from dynamic calls

21:29 <enebo[m]> For 1 callInfo is metadata already in callinstr

21:29 <enebo[m]> so you have it except for the liveness

21:29 <headius> aha ok

21:29 <enebo[m]> and having to know whether on the caller side if it has a splat or not

21:30 <enebo[m]> err callee

21:30 <enebo[m]> hahah you know what I mean

21:30 <headius> ah yes I se it in JVMVisitor.CallInstr

21:30 <headius> so I can do the rest of that myself

21:30 <enebo[m]> So my if/else was before that happened

21:31 <enebo[m]> you can rip all that out if you wantr to put the empty test into call impls

21:31 <headius> the boolean thing is interesting because every when has an additional aload + isTrue call right now and those would disappear

21:31 <enebo[m]> That is funny. I noticed that a few months back

21:31 <headius> not to mention branches and loops

21:31 <enebo[m]> It was one of those of crud

21:31 <enebo[m]> yeah I think I noticed it in branches

21:31 <enebo[m]> not neccesarily ===

21:32 <headius> indy sites will just wrap the return path with !(ret == false || ret == nil)

21:32 <enebo[m]> so how do you envision that...specialized call1ObInstr?

21:32 <enebo[m]> where b is boolean

21:32 <headius> yeah

21:33 <headius> we do have some support for boolean retval but it's in the unboxing stuff

21:33 <enebo[m]> yeah nice. then interp should not need an interpret method so long as it is just marked as calloperation

21:33 <headius> for some limited number of call forms, though, I think we can cover a large number of common isTrue checks

21:33 <headius> === will always be an arity 1 call

21:34 <enebo[m]> So you also need to add specialized Branch*

21:35 <enebo[m]> I said above you don't need to add interpret but you can and it will still work (albeit with boxing) because temp is Object

21:35 <enebo[m]> boxing probably is cheaper than isTrue

21:35 <enebo[m]> Actually I take that back the interp will not figure that out

21:36 <headius> short term it would just be a bytecode reduction

21:36 <headius> for JIT

21:36 <headius> it could be done as a JIT-only pass

21:36 <headius> so we don't introduce boolean boxing into interp

21:36 <enebo[m]> If you assume you are not always true or false it should get rid of bimorphism

21:37 <enebo[m]> unless it gets both impls back and says of one true and one false

21:37 <enebo[m]> which perhaps it does

21:37 <headius> longer term it makes it possible for us to have core methods return boolean rather than RubyBoolean and many such cases will then never even touch a RubyBoolean or do the nil/false check

21:37 <enebo[m]> less bytecode for sure

21:37 <enebo[m]> simple stack check for impl

21:37 <headius> so like str.match? or fixnum == fixnum2 can return boolean and start to fold away better

21:38 <headius> so I will do the callInfo change to indy sites first and then see what it would take to do the boolean thing

21:38 <headius> at least for case/when because that's a big offender

21:38 <enebo[m]> unrelated but I added some int instrs to support pattern matching

21:38 <headius> ah interesting

21:39 <enebo[m]> addInstr(new BIntInstr(sizeCheckEnd, BIntInstr.Op.LTE, argsNum, length));

21:39 <enebo[m]> where ArgsNum is Operand argsNum = new Integer(fixedArgsLength);

21:39 <enebo[m]> so primitive math is possible if you feel you need it internally

21:40 <headius> yeah nice

21:40 <headius> that will be useful later when we want to move more core to ruby

21:40 <enebo[m]> other things maybe not noticed is named labels

21:40 <headius> for e.g. loops we know we want to just be int loops

21:41 <enebo[m]> yeah so long as we can know the result is going to be an integer (well long I am pretty sure) we can

21:41 <enebo[m]> in fact BInt exists already too

21:42 <headius> yeah from the unboxing stuff

21:42 <enebo[m]> no from pattern matching I think

21:42 <headius> oh right

21:42 <headius> there's similar instrs for unboxing

21:42 <enebo[m]> I wonder how that stuff has held up

21:43 <headius> I do need to get more familiar with pattern matching and see what I can do in jIT

21:43 <enebo[m]> It is epic

21:43 <enebo[m]> but there are simple patterns

21:44 <enebo[m]> tbh it makes defined? look simple

21:44 <enebo[m]> but the basic patterns could be done much more simply

21:45 <headius> yeah I am keen to optimize them

21:46 <enebo[m]> I believe realizing something is homogeneous during parse could give some weight to doing more opted stuff

21:46 <enebo[m]> yeah in theory if you know all arms are 3-length array arms you could even just getaway with a single array length check and compare nothing

21:46 <enebo[m]> but there are lots of opts possible

21:47 <enebo[m]> unfortunately I don't know how common that is in real code

21:47 <enebo[m]> most uses I have seen mix lots of patterns together

21:47 <enebo[m]> even if just different arities (still that is optable as well)

22:23 <headius> two quick optimizations on a new branch: setCallInfo(0) now just does clearCallInfo

22:23 <headius> and an indy site so all callInfo in JIT are just load context ; indy

22:31 <headius> https://github.com/jruby/jruby/pull/7720