#jruby on 2023-02-28 — irc logs at libera.irclog.whitequark.org

2023-02-07 16:51 ChanServ changed the topic of #jruby to: Get 9.4.1.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

14:04 <headius> good morning!

14:52 apaokin[m] has quit [Ping timeout: 246 seconds]

14:52 BZK[m] has quit [Ping timeout: 256 seconds]

14:52 rcrews[m] has quit [Ping timeout: 252 seconds]

14:52 jimtng[m] has quit [Ping timeout: 252 seconds]

14:52 olleolleolle[m] has quit [Ping timeout: 252 seconds]

14:52 enebo[m] has quit [Ping timeout: 252 seconds]

14:53 kares[m] has quit [Ping timeout: 260 seconds]

14:53 byteit101[m] has quit [Ping timeout: 264 seconds]

14:53 nilsding has quit [Ping timeout: 246 seconds]

14:53 headius has quit [Ping timeout: 256 seconds]

14:53 PavanNambi[m] has quit [Ping timeout: 260 seconds]

14:58 apaokin[m] has joined #jruby

15:08 rcrews[m] has joined #jruby

15:08 jimtng[m] has joined #jruby

15:11 enebo[m] has joined #jruby

15:11 <enebo[m]> it must be false!

15:11 <enebo[m]> I don't get it

15:11 olleolleolle[m] has joined #jruby

15:12 <enebo[m]> oh the new vs old csv

15:12 <enebo[m]> yeah

15:13 <enebo[m]> In ruby profile I did see it was in a looping method all in Ruby but most of it was not self time

15:18 <enebo[m]> I could not get that to happen and I did threshold 0

15:18 byteit101[m] has joined #jruby

15:18 <enebo[m]> I guess I will need to try this again

15:19 <enebo[m]> I also think we should bench strscan separately as well

15:19 <enebo[m]> ah that could be the difference I guess

15:19 <enebo[m]> I was just running with default JIT settings then set to 0

15:20 kares[m] has joined #jruby

15:20 <enebo[m]> old csv with indy also I take it

15:21 <enebo[m]> I believe the main difference which will shake out is old csv tried as hard as possible to remove Ruby execution

15:23 <enebo[m]> native is basically best case JIT

15:25 <enebo[m]> From what I recall I was further apart but I may have been using my split_opts branch

15:26 <enebo[m]> yeah I did not look for loops but I still think that is a good idea

15:27 <enebo[m]> It is not in a ton of code without being able to inline but if there is a while there may be a great chance it will never jit

15:27 headius has joined #jruby

15:28 <enebo[m]> In this case I think it could much faster but in many cases that while is near the top and only called once

15:28 <enebo[m]> (although as a server it would get there I suppose)

15:28 <enebo[m]> immediately compiling while would probably stick out on single run things

15:28 <enebo[m]> while

15:29 <enebo[m]> I would like to see if that ends up being like 5 methods or 500

15:29 <headius> after 50 iters the top four items in profile are transcoding and regex

15:29 <enebo[m]> but I suspect loops are pretty uncommon

15:29 <headius> actually top seven

15:29 <headius> probably 70% of profile

15:30 <enebo[m]> now do that profiling for old

15:31 <enebo[m]> That bench + data was from an old reported issue and what makes it more interesting (to me) is that it does have some mbc utf-8 data in it

15:31 <enebo[m]> If we picked a pure ASCII file I wonder if we would see a big shift

15:31 <headius> interestingly 3.77% is getting thread-local joni stack when it is empty

15:31 <headius> so it is not caching right or just oddly heavy

15:32 <enebo[m]> not caching would be an amazing find

15:33 <headius> I'll do a profile of old_csv too

15:33 nilsding has joined #jruby

15:34 <headius> heavier regex use in the new one could explain it if we're allocating a lot extra

15:34 <enebo[m]> heh...I probably was using the split_opts branch since it does a lot of split(",")

15:34 <enebo[m]> but not all strings are clean 7 bit

15:35 <enebo[m]> I should land it removing my more aggressive attempt and converting more regexps to strings but I really want that feature in

15:35 <enebo[m]> right now it is passing for regexps of length 1

15:35 <headius> you could land the conservative stuff for 9.4.2 at least

15:36 <enebo[m]> yeah I just don't want to lose track of this

15:36 <enebo[m]> but I guess I sort of already did

15:36 <enebo[m]> once we can have some callsite data it can become even faster

15:36 <enebo[m]> since it can then switch to making an executor and not perform selection logic

15:37 <enebo[m]> (split can still realize it needs non-ascii after picking ascii so there is still something)

15:37 <headius> zero regex use on old_csv

15:38 <enebo[m]> heh

15:38 <enebo[m]> THE ANSWER HAS BEEN FOUND

15:38 <enebo[m]> hopefully at least it is not using match data on new one

15:38 <enebo[m]> but I think it is largely internal to strscan right?

15:40 <headius> I can check the back trace on those regex calls

15:40 <headius> Got to pick a sick kid up from school, brb

15:41 <enebo[m]> Do you remember which version of maven started swapping the order in.xml file generation?

15:41 <headius> It would be good to see your branch after the jet has run

15:41 <enebo[m]> after burners

15:41 <headius> I do not remember where that happened but you mentioned upgrading in here so maybe you can search for it

15:41 <enebo[m]> yeah ok

15:42 BZK[m] has joined #jruby

15:43 <headius> I just know the default maven on Fedora had the problem when I installed Fedora 36 or 37

15:44 <enebo[m]> I just told them to get a newer maven

15:44 <enebo[m]> which any new maven will be same ordering as us

15:44 <enebo[m]> (unless they changed something :)

15:44 <headius> Yeah I hope not

16:04 <headius> 100% of regex use in new csv seems to be strscan

16:08 PavanNambi[m] has joined #jruby

16:23 <enebo[m]> yeah. strscan is used in a lot of stuff for "performance"

16:23 <enebo[m]> so we should definitely bench how well we work and potentially tune as much as we can

16:31 <enebo[m]> https://gist.github.com/enebo/a7e59be8f08287be93fa7dd2695b754e

16:31 <enebo[m]> split_opts seemingly not making a lot of difference here but I as I said the mixed char data will basically just be slow path

16:33 <enebo[m]> On my machine (using java 8) the difference between old and new with threshold 0 and indy is 1.8s vs 1.6s so a bit over 10% slower

16:35 <enebo[m]> Thinking I could add a single byte split for utf-8 would help but I am not sure how many versions of things to add :)

16:35 <headius> maybe just not converting enough splits to make a difference

16:36 <enebo[m]> I could print out histogram of selections. It is possible the ascii one is never called to

16:36 <enebo[m]> I definitely think for new performance putting an instant JIT for loops would fix a bulk of the gap

16:37 <enebo[m]> I am going to figure out how many loops happen running various commands

16:37 <enebo[m]> I think explicit loops are uncommon enough where jitting them immediately would not effect warmup

16:40 <enebo[m]> segments.pop while segments.any? {|s| String === s }

16:40 <enebo[m]> segments.push 0 while segments.size < 2

16:40 <enebo[m]> segments.pop while segments.size > 2

16:41 <enebo[m]> lol so some weird stuff I did not expect to see but with that said this is all in the same method

16:42 <enebo[m]> 11 while loops in csv (new one)

16:44 <headius> is this mostly in one method?

16:44 <headius> I did not track down what looping method is getting jitted here

16:44 <enebo[m]> This appears to mostly be different methods in csv

16:45 <enebo[m]> I need to make sure these are being called but I guess they must since we lazily build methods

16:45 <enebo[m]> I just put a counter and print into buildConditionalLoop

16:46 <enebo[m]> 47 loops in gem list

16:46 <enebo[m]> I find this really odd though...openssl/buffering.rb has 6 methods with whiles in them show up

16:47 <headius> just starting up?

16:48 <enebo[m]> yeah

16:48 <enebo[m]> I am truly confused though...it is more than actually exist in the file

16:49 <enebo[m]> says 6 times but there are only 3 methods. Somehow we enter here twice

16:49 <enebo[m]> oh I see an until ok

16:49 <enebo[m]> so 3 untils and 3 whiles

16:49 <enebo[m]> and seeming they all get used

16:50 <enebo[m]> I should perhaps do this as a percentage of what would get compiled normally

16:52 <enebo[m]> 139 methods+blocks jit on gem list

16:52 <enebo[m]> but the 47 would be more than 47 since it likely has blocks in those methods

16:53 <enebo[m]> gem list is not really an important use case per se since I think users will --dev

16:53 <headius> I fixed one more and it looks like everything older than Nov 23 should be punted

16:53 <headius> mostly aspirational things getting kicked down the road

16:53 <enebo[m]> ok

16:53 <enebo[m]> oh you have something which was supposed to land for .11 sitting there

16:53 <enebo[m]> I think we did not land it for bake time

16:54 <headius> I see two PRs: update Psych and update joni + strscan

16:54 <headius> the latter is probably safe enough even if it is adding some features to strscan

16:54 <headius> psych I dunno but that was important for the CVE nonsense

16:54 <enebo[m]> 197 loops on that boom-app rspec bug (so rails loading coverage+rspec)

16:55 <headius> it would force 9.3 to yaml 1.2 and safe_load logic

16:55 <enebo[m]> joni+strscan was the PR

16:55 <enebo[m]> I think we decided to not stuff it in immediately before .10 release

16:55 <headius> it might be possible to switch older psych to newer snakeyaml library for CVE but that still forces yaml 1.2

16:55 <enebo[m]> not that we were only considering doing it

16:56 <headius> and I'd need to get a branch and update release for old psych

16:56 <enebo[m]> I honestly don't know enough about YAML to know if that is risky or not

16:56 <headius> ok I can review the strscan thing... it is already done on master in 9.4.1

16:56 <headius> it doesn't seem high risk but there are a few gotchas in aliasing

16:56 <headius> (yaml 1.2)

16:57 <enebo[m]> 455 methods /blocks JITd Rails app

16:57 <enebo[m]> 197 more things might be a lot

16:59 <enebo[m]> possibly interesting observation...a number of methods have nested loops

16:59 <headius> joni+strscan for 9.3 can land any time

17:00 <enebo[m]> in bootstrapping a number of these revolve around loops doing things with paths

17:00 <headius> ah sure

17:00 <enebo[m]> I think the main issue with the idea is it needs to be synchronous compiles

17:00 <headius> right

17:00 <enebo[m]> which could impact startup a lot more than normal JIT

17:01 <headius> I have hardly profiled JIT overhead at all, I'm sure it is heavier than it needs to be

17:01 <headius> other than irreducible costs like loading the bytecode into JVM

17:01 <enebo[m]> I have no doubt stuff like LVA is pretty expensive

17:01 <headius> ah so the other items on my list were in jffi

17:02 <headius> I will see about getting a centos VM to test glibc thing

17:02 <enebo[m]> cool

17:02 <headius> should I merge joni + strscan update into 9.3 branch?

17:02 <enebo[m]> I was thinking if I added dtoa to jnr-posix I could fail on non-native and probably land the printf stuff sooner than later

17:02 <headius> this also is a much better strscan that passes all tests so I think even with the feature additions it's a good move

17:02 <enebo[m]> eventually a pure-Java one could get added

17:03 <headius> yeah you should go for it on dtoa

17:03 <enebo[m]> oh I remember the issue with strscan

17:03 <enebo[m]> not that Ithink we care

17:03 <enebo[m]> it supports regexps in scan and that is not supported in 2.6

17:03 <enebo[m]> It will not break existing 2.6 uses but it will not complain

17:03 <headius> right

17:03 <enebo[m]> I think that is ok

17:03 <headius> I agree

17:04 <enebo[m]> it may lead to spec fails :P

17:04 <headius> PR is green so it seems ok

17:04 <headius> (with updated MRI strscan tests)

17:04 <enebo[m]> but 2.6 mri strscan tests?

17:04 <headius> no

17:04 <headius> those fail

17:05 <enebo[m]> just edge-case stuff like error cases?

17:06 <headius> I don't recall actually

17:06 <enebo[m]> while true

17:06 <enebo[m]> csv definitely was went over with a fine-tooth comb

17:07 <enebo[m]> for MRI at least

17:07 <headius> there was one tag for strscan update: https://github.com/jruby/jruby/pull/7634/commits/cf782940014e8aeaa85a19775298d7623a3fb036

17:08 <headius> other environments are starting to pick up the bad maven

17:08 <headius> https://github.com/jruby/jruby/actions/runs/4295362523/jobs/7485667605

17:09 <enebo[m]> -> { @s.scan("aoeu") }.should raise_error(TypeError)

17:09 <headius> looks like they are rolling an update out incrementally because most jobs work

17:09 <enebo[m]> ah so I remembered this backward

17:09 <enebo[m]> it was only regexp in 2.6 and string was accepted later

17:10 <headius> ahh right

17:10 <enebo[m]> I expected to see a failure for this but I am surprised it is a single spec

17:10 <headius> yeah low risk there

17:12 <enebo[m]> this csv parser is some pretty big and sophisticated looping methods

17:12 <enebo[m]> a lot of code and a bunch of nexts

17:12 <headius> fun

17:12 <headius> lots of cyclomatic complexity

17:12 <enebo[m]> I am not putting it down but I imagine these methods are pretty large in IR

17:14 <enebo[m]> The other random thought reading this is whether we are setting CR on substrings

17:15 <enebo[m]> You mentioned transcoding but us marking stuff as VALID vs 7BIT would have a huge difference

17:16 <headius> could audit cr propagation

17:17 <enebo[m]> we heavily rely on generic encoding methods in a lot of our string methods but we can definitely improve things lookinf more greedily for 7bit up front can end up much faster

17:17 <enebo[m]> Split on ascii string with "," ended up being like 80% quicker

17:18 <enebo[m]> which might mean something is not optimizing well in the string helpers we use but I think it largely because it is just some dead simple loop

17:35 <headius> there's probably many rounds of CR optimizations missing from cruby

17:39 <enebo[m]> yeah I would not be surprised if they added stuff and we never noticed. I do not really follow their development very closely.

17:39 <enebo[m]> They just tend to end up a reference when I want to understand behavior

17:54 <enebo[m]> 100% of all lines delivered to get_lines is CR 0

17:54 <headius> hah

17:54 <headius> well that's not helpful

17:55 <enebo[m]> yeah I guess I need to figure out where it came from and whether it reasonably should have one

17:57 <enebo[m]> we make so many micromistakes around sharing code

17:57 <enebo[m]> some callers call through a method which looks up a separator but then calls another method which has to have same logic because other things call it directly instead

17:57 <enebo[m]> in case of each line stuff it means checking opts twice and checking separator arg twice

17:58 <headius> yeah bad refactoring probably

17:58 <enebo[m]> really small overhead obviously

17:58 <enebo[m]> the main split method is 130ish lines with several call outs to other ways (enumerate) in a few places

17:58 <enebo[m]> I definitely feel we get here because we tend to port some methods closely

18:03 <enebo[m]> interestingly this method does not even care about CR or even if the string is a valid string

18:05 <enebo[m]> It just seems to take it at its word but after the recent mail gem issue with joni I wonder how much code is literally forgiving on broken data

18:05 <headius> hmm I booched something in chr fix

18:05 <headius> centos installing on my other machine

18:05 <enebo[m]> lunch

18:08 <headius> oops, >> vs >>>

18:08 <headius> I can never remember the right one

18:48 <enebo[m]> yeah bit math sux

19:08 subbu has joined #jruby

19:24 <enebo[m]> IO.gets has a fast path which will set CR but slow path does not seem to have that logic

19:25 <enebo[m]> Testing to see if it actually is hitting that path or not

19:30 <enebo[m]> OpenFile.getlineFast which I thought was setting cr is not providing any way for it to be anything other than 0

19:31 <enebo[m]> I suspect cr was int* in MRI

19:31 <enebo[m]> So slow path of getline has no cr logic and fast path is looking at a value which I think is always 0

19:32 <enebo[m]> We pass all the things so I am guessing cr is merely for optimization and not for correctness in these methods

19:33 <headius> ok so I finally got this tested on centos

19:34 <enebo[m]> yeah cr is passed &cr

19:34 <headius> the builds I did in CI against an earlier debian are still picking up newer glibc so it did not help

19:34 <headius> building on centos directly does produce binaries that work, but it's not reproducible unless I can fix it in CI

19:34 <enebo[m]> yuck

19:35 <enebo[m]> So I am really curious will fixing cr calculation on IO speed things up

19:35 <headius> it is frustrating that glibc does not have better support for building against a new version and still working on older ones

19:35 <headius> so I think we punt this and I'll try to come up with something else

19:35 <headius> I will open an issue against 9.4.3 so it doesn't get lost

19:35 <enebo[m]> I imagine in some methods it will just say "oh no cr I better walk it once"

19:35 <enebo[m]> but in each_line it just says "you are utf-8 great...utf-8 walking"

19:35 <headius> yeah if it is walking a lot of strings it doesn't need to that would be overhead

19:36 <headius> I did see some calls to CR calculation stuff in profile but not up very high

19:36 <enebo[m]> so each_line is not doing that second walk but other methods are likely scanning or at least using CR for opts

19:36 <enebo[m]> but IO is not making a CR other than 0 so it just opts out all IO created data from playing

19:37 <enebo[m]> It might only be getline although I suspect that is a very common way of working with data

19:37 <enebo[m]> This may yield something

19:38 <enebo[m]> headius: so OpenFile can have a cr field which is only used for temporary cr calcs or each of these methods can make int[] cr = new int[] { 0 };

19:38 <enebo[m]> using a field prevents the box but it may be confusing

19:39 <enebo[m]> part of me feels like JVM should properly be able to escape single value arrays as out params but I guess it is not that simple :)

19:41 <headius> https://github.com/jruby/jruby/issues/7695

19:42 <headius> yeah would need to make sure any shared state is accessed under lock

19:42 <headius> but it is probably doing that

19:43 <enebo[m]> I believe all IO methods have lock() in places but yeah I think it maybe would be brittle since you would have to remember this

19:43 <enebo[m]> I could see accidentally doing it outside the lock and creating potentially very strange errors

19:43 <enebo[m]> Part of me wonders how just nuking all CR out of IO code would help or hurt

19:44 <enebo[m]> gets it is still doing lots of checks and logic but it is always 0

19:44 <enebo[m]> but if you accept IO does not do CR does removing all that logic speed up IO at the cost of other things being able to use it

19:45 <enebo[m]> Thie is a very long way of wondering how much cr costs relative to how much it helps

19:45 <enebo[m]> I predict it is a huge benefit for ASCII data but that is anecdotal

19:46 <enebo[m]> I am going to make a bench of IO.gets that calls something which uses CR

19:57 <headius> IO with transcoding should definitely be able to set CR since we know it has successfully converted

19:57 <headius> without that we don't really scan for chars so it would need one scan somewhere

19:57 <headius> so I also pushed this today: https://github.com/jruby/jruby/pull/7693

19:57 <headius> turns on color for backtraces and ir.print if running on a tty

20:00 <enebo[m]> nice

20:15 <headius> This Maven bug is getting annoying now. Who knows how long until GitHub updates to a fixed version, if the fix has even been released yet

20:15 <headius> I guess we just need to force all jobs to use a known working version for now

20:28 subbu has quit [Quit: Leaving]

20:33 <enebo[m]> interesting. you did something clever to avoid out param for cr but it is not hooked up to anything

20:34 <enebo[m]> you know pos will be int and you increate an int using a long which loses second half of the long

20:34 <enebo[m]> but second half is where you were stuffing CR data

20:35 <enebo[m]> so I will reaudit the methods in question but I think I can make smaller changes to keep CR properly updated

20:37 <enebo[m]> Although looking at this method I don't think it does anything other than figuring out CR (which is fine) but we are not setting it so it is just dead work. For fun I will delete this call and see if this is faster

20:42 <headius> Ha I must have gotten distracted

20:42 <enebo[m]> who knows. A lot of this is really old too

20:43 <enebo[m]> It is possible it was hooked up at some point

20:43 <headius> fwiw JVM should be able to eliminate temoprary boxes if they inline and don't escape but it's not guaranteed of course

20:44 <headius> JVM as in Hotspot

20:44 <enebo[m]> Is a precondition that the place it escapes has been inlined right?

20:47 <enebo[m]> in theory my bench uses index on each line and if 7bit it should just byte index

20:47 <enebo[m]> I should probably do aref perhaps to just make it as dramatic as possible

20:49 <headius> yes must inline

20:49 <headius> and can't be used as part of any branching structure, unless they have improved that

20:50 <headius> we might get better escape analysis with a single-field carrier object also since arrays have bounds checks and things

20:51 <enebo[m]> this is just bit mathing it into the return result

20:51 <enebo[m]> using long instead of int

20:52 subbu has joined #jruby

20:52 <headius> yeah

20:53 <headius> when we can do that it's obviously ideal

20:53 <headius> I just did it for the chr fix

20:55 <headius> enebo: I will manually merge that irb fix and avoid his pom.xml juggling commits

20:58 <enebo[m]> ok

21:26 <enebo[m]> https://gist.github.com/enebo/c9a188a0599ec7087ae1d68d6a4f9a8e

21:26 <enebo[m]> A bit noisy but CR hooked up is setting CR to 7bit and then using simpler index

21:27 <enebo[m]> The actual cost of opening and reading perhaps makes it noisier

21:27 <headius> seems like a good improvement though

21:27 <enebo[m]> well when it matters it will matter a lot like perhaps a more complicated regexp

21:28 <enebo[m]> I thought aref would be pretty visible and it is not so much but it is shiowing something

21:28 <enebo[m]> I am loading from YAML test file we had so many lines are shorter

21:28 <enebo[m]> I could best case this and make each line 10k long but who has data like that

21:29 <enebo[m]> Going to hook up another method I see not doing this and then look at the slow path the csv bench is using

21:35 <headius> ok

21:35 <enebo[m]> https://github.com/jruby/jruby/pull/7696

21:38 <headius> looks good to me

21:38 <enebo[m]> MRI slow path does not seem to bother with CR

21:41 <headius> https://github.com/jruby/jruby/pull/7697

21:41 <headius> maven downlevel for GHA jobs

21:47 <headius> enebo: I think something you fixed fixed this one: https://github.com/jruby/jruby/issues/7689

21:48 <enebo[m]> Yeah that is a duplicate

21:48 <headius> ah nice

21:48 <enebo[m]> https://github.com/jruby/jruby/issues/7639

21:48 <headius> ok

21:50 <enebo[m]> good news...I just made the OLD version of csv faster: https://gist.github.com/enebo/e40f4d59dd41438b51e6b96d784711a0

21:50 <headius> haha

21:51 <enebo[m]> oh and that is without the split opts

21:51 <enebo[m]> but I did not see much improvement with that so?

21:51 <headius> FWIW the old one is also fastercsv by JEG so it's not like the really old csv is winning or something

21:51 <enebo[m]> oh of course I didn't

21:51 <enebo[m]> It had no CR

21:55 <enebo[m]> yeah not much different but I think it is because much of it is UTF-8 and not 7 bit

21:57 <enebo[m]> In fact I think nearly 100% have mbc

21:58 <enebo[m]> I have some huge ascii .csv file with the same lines over and over

22:11 <headius> bleh, irb must have some updated tests in 1.4.2

22:20 <headius> enebo: I pushed those maven downgrades to master so things should work now

22:21 <headius> enebo: this might be a snakeyaml bug: https://github.com/jruby/jruby/issues/7698

22:21 <headius> or invalid yaml

22:33 <enebo[m]> hmm

22:33 <enebo[m]> some dangling reference?

22:35 <enebo[m]> but all YAML is utf-8

22:35 <enebo[m]> I believe it is including an encoding reference as an ivar for a string which only have \u char in it so maybe it just knows that is utf-8?

22:38 <enebo[m]> https://gist.github.com/enebo/b9a1587ce03b42d9cd1d631e22c00f6b

22:38 <enebo[m]> So it appears to be a tiny bit faster and almost breaks loose initially

22:38 <enebo[m]> Not sure what is up with that really fast first pass

22:40 <enebo[m]> headius: you should merge this one: https://github.com/jruby/joni/pull/61

22:41 <enebo[m]> looks obviously correct except if it is an encoding which has no valid java charset

23:58 <headius> I'll have a look at that