#jruby on 2023-01-11 — irc logs at libera.irclog.whitequark.org

2022-01-19 17:17 ChanServ changed the topic of #jruby to: Get 9.3.3.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

00:04 <headius> well adding naive caching to a few common super types and refined calls reduced time per iteration by 7-10%

00:04 <headius> this is just caching, not enabling inlining or any other optimizations

00:51 <lavamind[m]> headius: re armhf spec-ffi, the failures are consistent from one run to the next at least, its not random

00:52 <lavamind[m]> Failed examples are listed here https://share.riseup.net/#a5_p6hYlSt-CG5zluUZx5g

00:55 <lavamind[m]> Oh this was already in the log I sent, I thought I had needed to trim it

05:56 <headius> ok

12:02 olleolleolle[m] has joined #jruby

12:02 * olleolleolle[m] waves all around, smiling and bobbing his head

12:03 <olleolleolle[m]> https://github.com/rubygems/gemstash/actions/runs/3889717845/jobs/6638217927 I am looking at gemstash's use of Psych, with JRuby 9.4. A NoMethodError "undefined method `parse' for #<Psych::Parser:0x388623ad> /home/runner/.rubies/jruby-9.4.0.0/lib/ruby/stdlib/psych.rb:455:in `parse_stream'" - hmmm.

14:22 <lopex[m]> https://outerproduct.net/trivial/2023-01-11_nan.html

14:33 <enebo[m]> olleolleolle: 🎉🚀

20:58 <headius> so my perf experiments yesterday seemed to help even though they were trivial

20:59 <headius> I've pushed a branch to my repo called "optz"... the only changes in there currently are caching the lookup from a few super forms and refined calls but it dropped method lookup in profiling to about half what it was

21:00 <headius> time per iteration of railsbench went from mid 1400ms to mid/low 1300ms... without any inlining or indy wiring

21:01 <headius> I'm going to pivot back to issues today but this seems promising... the top profile issues all can be greatly improved

21:01 <enebo[m]> yeah that sounds good

21:02 <enebo[m]> I am also taking a swipe at removing the bifurcation due to **empty_kwargs

21:02 <enebo[m]> for generated code it is not probably much of a difference for speed but it makes all kwargs callsites larger

21:02 <headius> yeah I saw a ton of hash overhead in the profile too so that is an area we need to improve

21:03 <headius> including uncached lookups coercing objects to hash

21:03 <enebo[m]> well hash itself is complicated without punching through ordered args instead of hashes

21:03 <headius> yeah for sure

21:03 <enebo[m]> but with that said a lot of it will always be hashes

21:03 <headius> need to make sure it is using the single-bucket hash for those

21:03 <enebo[m]> yeah that would be a good improvement

21:03 <enebo[m]> and something we can do sooner than later

21:04 <enebo[m]> I just fixed another kwarg report...yield(a: 1) was not being made a kwarg because we have that uwrap behavior

21:05 <headius> ah yeah

21:05 <enebo[m]> I had a fixme by it saying "I bet this won't work with only keywors" HAHAHA

21:05 <enebo[m]> but I cannot be expected to notice a stray comment from 2014

21:07 <headius> foo(a:1,b:2,c:3) does appear to be a single-bucket hash

21:08 <enebo[m]> smallHash?

21:08 <headius> we need to split our Hash impl into a base abstract and some different impls... we could make this tiny when we know it's a small kwargs hash

21:08 <headius> just an array of IRubyObject key, value, key, value for example

21:09 <headius> or a specialized Hash3 that has three key and three value fields

21:09 <headius> yeah smallHash logic

21:09 <headius> it is still like five objects though

21:09 <enebo[m]> hmm in interp it is RubyHash.newHash

21:10 <headius> RubyHash, array[1], and three hash entry objects

21:10 <headius> hmm

21:10 <enebo[m]> kwargsHash is a thing in arguments compiler

21:10 <headius> I will try interp

21:10 <enebo[m]> I bet you do small hash

21:11 <headius> woah yeah

21:11 <headius> interp is not doing it

21:11 <headius> bucket array is 11 wide

21:11 <enebo[m]> heh

21:12 <enebo[m]> Though this should be a bit smarter since Hash is also a Hash literal

21:12 <headius> small hash literals should also use this

21:12 <enebo[m]> and one which can grow. So probably just same logic as compiler where it is only for kwargs

21:12 <enebo[m]> I am not so sure

21:13 <enebo[m]> unless that hash swaps to more buckets it could be bad to pass that literal to a method which adds 2000 more keys

21:13 <headius> it would immediately rehash if modified in most cases

21:13 <headius> and spread out the buckets

21:13 <enebo[m]> ok then why not always use smallHash

21:14 <enebo[m]> I suppose this change would do that

21:14 <headius> well if it starts with 100 entries it's a linear search to look anything up

21:14 <enebo[m]> we do have pairs.length so this is not worth discussing :)

21:14 <enebo[m]> but it may make sense to have some smarter method in RubyHash which picks for us

21:15 <enebo[m]> that new method could also use any eventual shaping

21:15 <enebo[m]> newHash(int initialEntries)

21:15 <headius> https://github.com/jruby/jruby/blob/129bd6312839fed1e7054af22607bbcd09b8ec21/core/src/main/java/org/jruby/ir/runtime/IRRuntimeHelpers.java#L1562-L1578

21:16 <headius> looks like I use 5 as the limit for small hash

21:16 <headius> oh no 10

21:16 <enebo[m]> heh this method does exist then

21:16 <headius> yeah length of incoming array / 2 > 10 it goes to normal hash

21:17 <enebo[m]> sequential check of 10 is likely not a bad value

21:17 <headius> yeah it's probably an arbitrary number

21:17 <enebo[m]> ultimately the allocation of short-lived hashes probably overshadow actually using them too

21:18 <headius> for kwargs most will immediately get deconstructed into local vars too

21:18 <enebo[m]> so 10 element hash only used once for a short time vs making mega bucket system to just get thrown out

21:18 <headius> yeah

21:18 <headius> so I will hook up IR Hash to this

21:18 <enebo[m]> cool

21:19 <enebo[m]> Object array [a, 1, b, "foo", c, :bar] where lookup is sorted :)

21:20 <headius> yeah that is what I envision when we plumb kwargs through

21:20 <enebo[m]> ln(n) equals would probably scale a lot higher than 10

21:20 <enebo[m]> but largely you only want this for read-only

21:20 <headius> yeah

21:21 <enebo[m]> which for static known kwargs this is just how we pass stuff

21:21 <enebo[m]> but you just said that

21:23 <headius> and indy will ideally not alloc anything at all when target method receives kwargs and has no restkwargs

21:24 <headius> kwarg arguments passed on stack as key, value, ... and put in the right places on the receiving side

21:24 <enebo[m]> yeah nice

21:27 <headius> yeah measurable improvement on a kwargs bench with this

21:27 <headius> noisy from alloc but it goes from 3.3-3.5 down to 3.0

21:28 <headius> alloc is always the problem

21:28 <enebo[m]> what does 3 mean?

21:28 <headius> oh seconds for this kwargs bench

21:29 <enebo[m]> ok. for some reason I thought you meant railsbench :)

21:29 * headius sent a code block: https://libera.ems.host/_matrix/media/v3/download/libera.chat/b714a5f35d04bf812c58ffd6810ad079d1cf2974

21:29 <headius> we probably won't see it affect railsbench much because most kwarg calls should get jitted

21:29 <enebo[m]> yeah I would assume so

21:30 <enebo[m]> I do not expect to see any change unless something does not JIT

21:30 <enebo[m]> Even then I would not expect it

21:33 <headius> I pushed it to master

21:33 <headius> YOLO

21:33 <enebo[m]> nice!

21:38 <headius> this snakeyaml thing is a pit of despair

21:38 <headius> Andrey does not believe these are true exploits and that existing mitigation is enough

21:39 <headius> https://bitbucket.org/snakeyaml/snakeyaml/issues/561/cve-2022-1471-vulnerability-in

21:39 <headius> check out this epic yarn

21:45 <enebo[m]> HAHAH omg I feel for this guy

21:45 <headius> yeah

21:45 <enebo[m]> The feature is you can load anything in your classloader PERIOD. But I may accept random untrusted data which does something bad. Ok?

21:45 <enebo[m]> They literally do not understand there is no solution to that

21:46 <enebo[m]> unless they want a new feature with a whitelist even then it could still be a DOS

21:46 <headius> I think I can dodge this issue by using this SafeConstructor mentioned along the way

21:46 <enebo[m]> but even with CL you could restrict types by just putting in a CL which can only see some classes

21:46 <headius> basically fix Psych to configure SnakeYAML to not do any Java object support

21:46 <headius> which is what he mentions as current mitigation

21:47 <enebo[m]> Yeah we are not loading Java so seems safe

21:48 <enebo[m]> oh heh... I said that before I made it to whitelist discussion

21:51 <enebo[m]> this is epic

21:51 <enebo[m]> Let's face it YAML supporting object extensions was always a huge mistake

21:52 <headius> I'm not sure we even use this logic

21:52 <enebo[m]> We must supoprt !ruby or whatever that syntax is

21:52 <enebo[m]> but I imagine that is us making the instances directly

21:53 <enebo[m]> which would likely be the same CVE in a difference form as this snakeYAML issue since untrusted data could conceivably create any Ruby instance

21:53 <headius> that is done at the Psych level though

21:54 <enebo[m]> yeah I am not saying that is our snakeyaml problem but just that it is a generic problem of YAML allowing this in the first place

21:54 <enebo[m]> Psych will end up with this report

21:54 <headius> and I think they've mostly mitigated that in recent versions (unsupported by default)

21:54 <headius> yeah for sure

21:54 <enebo[m]> yeah turning it off is the only way to make it safe

21:54 <headius> it is a problem for any markup that can be used to arbitrarily create objects

21:55 <enebo[m]> yeah it is the basic issue. but it is the feature. So I still cannot see how snakeYAML can do anything about it

21:55 <enebo[m]> but I am not done reading either :)

21:58 <enebo[m]> "This call chain is possible, not because of a misbehaving constructor, but because SnakeYaml calls hashCode on an untrusted object, leading to remote code execution."

21:58 <enebo[m]> HAHAHAH wow

21:58 <headius> an untrusted object whos code you have already loaded into the VM

21:58 <enebo[m]> "I like the solution where whatever the problem code is, is just disabled by default. Just my two cents."

21:58 <enebo[m]> I like this guy

22:00 <headius> he has multiple paragraphs and links on the main project page about these CVEs

22:02 <headius> yeah I don't think we are affected

22:02 <headius> we don't even use the yaml serialization system from SnakeYAML, we use the parser directly and just pass its nodes to Psych Ruby code

22:02 <enebo[m]> ' 00% of the application which use SnakeYAML do not parse data from untrusted sources.

22:02 <enebo[m]> I’m honestly baffled by this claim. This CVE and the so called “low quality tooling” pointed us to a huge security issue in our application. We use SnakeYaml to parse somewhat complex configuration files and while those can only be accessed by admins of the application, that does not mean that they are trusted to execute arbitrary code on a production system.'

22:03 <enebo[m]> I don't understand this. They have admins who write config files and are worried they will attack the production system?

22:04 <enebo[m]> Perhaps those "admins" are just customers? If people are worried employees can inject and attack their own systems then I think they maybe have another issue

22:07 <enebo[m]> so to reiterate. I do think YAML should not support this and for this guy's sanity he should just toggle it off by default with a simple way for people to enable it (which most people do not need it)

22:08 <enebo[m]> The YAML police will not come after him but these people will never stop reporting this and it is clear the guy is seriously burnt out

22:08 <headius> yeah this is all based on using his high-level entry point, the Yaml class, to dump and load YAML from/to Java objects

22:08 <headius> we don't do that

22:09 <headius> oh hey I had an epiphany about the yarp parser... by the time it's usable we may have Panama for real in an LTS JDK... we can use that to generate a wrapper on the fly without shipping additional JNI code

22:10 <enebo[m]> yeah I think so too re YARP but I am currently suggesting the JNI bindings are part of YARP

22:10 <enebo[m]> We can connect to the .so however we want though

22:10 <headius> 21 out this fall will be the next LTS

22:10 <headius> yeah

22:10 <enebo[m]> I almost did it with FFI for my POC

22:11 <enebo[m]> but benoit already had made the jni so I just used that

22:11 <enebo[m]> It literally binds one void method with no args and one method byte[], byte[]

22:33 <enebo[m]> https://github.com/jruby/jruby/pull/7574

22:33 <enebo[m]> Not too important but I am happy to see this get a little smaller in size

23:10 <headius> enebo: ah yeah nice

23:10 <headius> I realized I could also embed these into the indy call path and eliminate the setCallInfo call from bytecode

23:11 <enebo[m]> The basic handling logic is still in 3 places so I may get that smaller but there are subtle differences

23:11 <headius> if it were an attribute of the call instr that would be easier to do

23:11 <enebo[m]> yeah in MRI these are embedded into the call site data itself

23:11 <headius> right

23:11 <enebo[m]> this is much more of a make it fit

23:11 <headius> makes sense and we'll need to do that for kwarg plumbing later anyway

23:12 <enebo[m]> and my current dirty tree is eliminate the keywords = brittleness in native methods

23:12 <headius> all calls will need a kwarg descriptor at some point, but that can just be the call info right now

23:12 <enebo[m]> I am going to not erase callinfo in native methods but mark callinfo it was in a native method

23:12 <enebo[m]> if it goes to a normal ruby method that method will see what it passed still so all passthroughs will just work

23:13 <headius> ah yeah

23:13 <enebo[m]> native -> native calls will know it should erase callinfo unless the second native call is keywords

23:13 <enebo[m]> the only problem here is I reversed the brittleness

23:13 <enebo[m]> any calls which do not take keywords will still think they are passed

23:14 <enebo[m]> (in native methods which happen to callMethod)

23:14 <enebo[m]> In my mind, at least, I believe no method we callMethod will actually be processing kwargs unless they expect them

23:14 <enebo[m]> If they expect their own we have to set that up in Java and we will mark callinfo for that to work

23:15 <enebo[m]> but I do not know of any direct java to ruby calls we make via callMethod which would expect or get data from the original method and expact it to be not a kwarg

23:16 <enebo[m]> That perhaps sounds a little confusing but the conclusion is I think the number of places where we see brittleness in reversing this will be ~0

23:17 <headius> once we have a kwarg call path, this will just go away

23:17 <enebo[m]> well yeah callInfo should go away. It is too clever

23:17 <headius> it will be implied by calling along the kwarg call path anyway

23:17 <enebo[m]> I am not giving myself credit there but I am just saying making something which is per callsite and making it a single value per thread is pretty complicated

23:17 <headius> yeah

23:17 <headius> well, moving the current callInfo into the call site wouldn't be too hard

23:18 <headius> just make it an attribute of all call instructions and I can embed it into the call site (or just emit what we have now for non-indy call path)

23:18 <enebo[m]> the main problem is where we have callInfo from a call (think interp) and where we have access to either indy of a CallSite object itself

23:18 <enebo[m]> for JIT this is not hard because you have it all

23:19 <enebo[m]> In the actual call itself it needs to be able to ask for it.

23:21 <enebo[m]> MRI solves some of the complexity by having new call_keywords sorts of methods

23:21 <headius> right

23:21 <enebo[m]> so ec I think gets more easily accessible (it has been a long time since I looked)

23:22 <enebo[m]> but I am happy to see this get replaced and you may remember I also want native calls to be able to receive their callsite

23:22 <enebo[m]> or to be able to access it

23:22 <enebo[m]> with generic cache storage

23:22 <enebo[m]> then sprintf and strftime and regexp (etc) can just save things off without needing special stuff

23:23 <enebo[m]> Even if this was @JRubyMethod(needsSiteCache=true) I think it would be cool

23:23 <headius> byteit101: https://github.com/jruby/jruby/issues/7565

23:24 <headius> subspawn fixes this so an argument could be made for backporting it into 9.3, but I think my comment is right about just using subspawn on 9.3 to replace spawn

23:25 <headius> I guess I could fix our legacy spawn though too

23:26 <byteit101[m]> But subspawn shouldn't be used unless users are requiring it themselves

23:27 <byteit101[m]> also, require-builtin isn't the public API, it is for ruby impls to call. I commented on the ticket

23:27 <headius> if someone wanted to use subspawn to replace spawn, though, that's the simplest way, no?

23:28 <byteit101[m]> subspawn/replace not subspawn/replace-builtin

23:28 <headius> ok

23:29 <byteit101[m]> replace-builtin doesn't pull in PTY stuff

23:37 <headius> enebo: hah, you fixed this

23:38 <headius> https://github.com/jruby/jruby/commit/e38db716a2c393b2f1fefec3ea315da64206e0d6

23:40 <headius> enebo: I think we should just do 9.3.10 as soon as possible rather than 9.3.9.1

23:40 <headius> the .1 was for more snakeyaml garbage mostly wasn't it?