#jruby on 2023-02-04 — irc logs at libera.irclog.whitequark.org

2022-01-19 17:17 ChanServ changed the topic of #jruby to: Get 9.3.3.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

03:33 <byteit101[m]> hmm, lots of warning: unknown module org.jruby.dist specified to --add-opens

03:49 <headius> Oh?

04:10 <byteit101[m]> and spec:ruby hung

04:10 <byteit101[m]> fresh master checkout

04:11 <byteit101[m]> NING: Unknown module: org.jruby.dist specified to --add-opens... (full message at <https://libera.ems.host/_matrix/media/v3/download/libera.chat/032923e22252785e588e6703ad1402bebd70e2fe>)

04:11 <byteit101[m]> (etc)

04:12 <byteit101[m]> jdk 19 and ant 1.10.13

04:13 <byteit101[m]> windows 10

04:22 <headius> That should be the right module name running like a ruby command line

04:27 <byteit101[m]> Yes, but it seems to be under java.base not org.jruby.dist for some reason

04:27 <byteit101[m]> command line is .\bin\jruby.bat -S rake spec:ruby

06:20 <headius> Hmm

06:20 <headius> Ok could be a problem

06:20 <headius> In any case it's showing up for you so open a bug

09:00 cgautam[m] has quit [Quit: You have been kicked for being idle]

14:01 <headius> so I'm looking at this CSV thing and the biggest impact is character transcoding... removing "UTF-8" from the CSV foreach call doubles performance on 9.4.1

14:02 <headius> an allocation trace shows the top item as Joni `Ptr` instances, which we use to track the current pointer into the input and output byte arrays... it's >32% of the allocations in this benchmark run

14:04 <headius> not Joni, sorry, jcodings transcoder Ptr

14:05 <headius> preallocating that once per OpenFile/IO helps a little bit

14:09 <headius> the other big allocator is Joni of course

15:10 <enebo[m]> headius: if you believe the rdoc on the old fastercsv it says it also transcodes to utf-8

15:13 <headius> What I know is that removing the explicit internal encoding UTF-8 from the reporters script greatly improves the performance, but I have not validated that it's still parse is correctly

15:14 <enebo[m]> so just re-reading the older versions comments bit more I think only the headers are processed as UTF-8 and the rest is agnostic

15:14 <enebo[m]> so that could be a reasonable explanation for some of it

15:15 <headius> So in this case he is causing it to also transcode the rest of the content which it did not do before

15:15 <headius> A theory

15:15 <enebo[m]> I did a time profile between the two versions of fastercsv and there is a lot more going on in the newer version in Ruby itself as well

15:15 <enebo[m]> yeah if so that would be an obvious chunk perhaps enough to say these are close enough

15:15 <headius> The one he linked is the old one you tried?

15:15 <enebo[m]> I opened this issue

15:15 <headius> Oh right duh

15:15 <enebo[m]> yeah that was the version of fastercsv that I noticed was 2x faster

15:16 <enebo[m]> 1.7 was 1.9 by default and it did support m17n but fastercsv is not explicitly transcoding and default encoding was ascii

15:17 <enebo[m]> In notes from that other issue that did have a sizeable diff in perf even without transcoding

15:17 <enebo[m]> but largely due to complexity of walking UTF-8 over byte indexing

15:18 <enebo[m]> https://gist.github.com/enebo/785b29fe428186aa4517d58d91fcc500

15:18 <headius> It turns out my layover in Newark is like 7 hours so I'm going to find a quiet place to work

15:18 <enebo[m]> yow

15:19 <enebo[m]> I spend a little time looking and I noticed things like String#split is prettyyu minor perf wise on older one but StringScanner#scan is pretty large in comparison

15:20 <enebo[m]> If this is in fact due to being UTF-8 then it should be simple enough to just make the test string UTF-8 and see the old one make split take much longer (which it may)

15:20 <enebo[m]> This profile is what made me opt != for fixnum (which eliminated half those calls on the old version)

15:21 <enebo[m]> Not important but just an aside

15:24 <headius> Nice

15:27 <enebo[m]> So another weird aspect of the UTF-8 transcoding detail

15:27 <enebo[m]> It is UTF-8 by default in 9.4

15:27 <enebo[m]> It shouldn't be doing anything

15:42 <headius> The transcoding overhead I saw was during IO

15:46 <headius> I think the encoding being passed in is passed through to IO as the external and internal and codings so get enables and extra layer of trends coating

15:47 <enebo[m]> hmm

16:23 <headius> JRuby 1.7 can't run on Java 17 because the modules got locked down

16:24 <enebo[m]> I have been using 8 for a reasonable comparison between the two

16:25 <enebo[m]> not specifically for this but in the other issue where the reporter is on 8

16:25 <enebo[m]> that is where I found the split optimizations but if I went up to 17 + indy I did get even more perf relative to MRI

16:26 <headius> I also noticed a lot of interpreter time in profile until I ran with jit threshold 0

16:29 <enebo[m]> This is in part because some toplevel methods do not run enough to JIT but are somewhat substantial

16:29 <enebo[m]> I also did the same thing which dropped down to 2.2s once that happened

16:30 <headius> yeah I figured it is some toplevel parser loop

16:30 <enebo[m]> old csv was much less Ruby

16:30 <headius> oy vey

16:30 <enebo[m]> One thing I also just realized is I did not check for exceptions

16:30 <enebo[m]> I did not spend much time on this to be fair but I did profile it

16:31 <headius> I just found a transcoding-related method used by this hot path that always allocates a StringBuilder to build an error message even if there was no error

16:31 <enebo[m]> huzzah

16:31 <headius> #2 in alloc profile

16:31 <enebo[m]> hahah

16:32 <enebo[m]> So what is even happening in transcode...isn't this a UTF-8 IO source being transcoded to UTF-8?

16:32 <headius> I dunno, your example had ISO-8859-1 in that call and if I remove it entirely it complains about the file not being UTF-8

16:33 <enebo[m]> ah yeah. the funny part here is I just grepped to see if I had a csv bench and this happened to be in my snippets dir

16:33 <enebo[m]> Perhaps the encoding: bit is strange in real life...or not

16:34 <enebo[m]> I guess the input is opened as 8 bit ascii so it cannot just use UTF-8

16:34 <enebo[m]> In this case perhaps this is a weird bench and it may just come down to 1.7 not really transcoding stuff

16:35 <headius> yeah could be

16:35 <headius> but I have found some improvements nonetheless

16:35 <enebo[m]> 1.7 is 1.9.x so it did supposedly handle transcoding but I do not think the layer was really working until later in 2.0 support

16:36 <enebo[m]> You said if you remove encoding: it gets a lot closer?

16:36 <enebo[m]> MRI between the two versions is like 5% slower or something around there

16:36 <enebo[m]> so it does slow down between the two versions but it is so close

16:36 <headius> if I remove encoding: altogether it errors

16:37 <enebo[m]> This was another reason why I figured this was a reasonable issue since we tank a lot more

16:37 <enebo[m]> ah it is real data and is really some ascii 8 bit

16:38 <headius> ok with my two fixes alloc profie is now all joni and strings

16:38 <enebo[m]> lol emacs is very unhappy selecting data in that file

16:38 <headius> 70GB of int[] in one minute sampled profile

16:39 <enebo[m]> joni needs some threadlocal temp buff love?

16:39 <enebo[m]> or is this all in-flight data which needs there own string

16:39 <headius> this is the machine workspace

16:39 <enebo[m]> oh joni

16:39 <headius> it could be cached

16:40 <enebo[m]> my brain read it as jcodings even though I even wrote it as joni

16:40 <enebo[m]> many many of the fields in joni are int

16:40 <enebo[m]> we just need to stuff them all in an array and unsafe write them

16:40 <headius> ah this is from Region alloc

16:41 <headius> so results of match

16:41 <enebo[m]> yeah region feels like it could mostly be [1, 2]

16:41 <headius> pretty sure this already has an optimization when there's only one region so not sure how much we can improve this

16:42 <enebo[m]> The problem of course some amount of time region is also two arrays of values for multiple matches

16:42 <enebo[m]> but many many regexps only have a single begin,end

16:42 <headius> actually it does not

16:42 <enebo[m]> yeah it makes two arrays all the time

16:42 <enebo[m]> I also think if nothing else this could just become one array as well

16:43 <headius> ugh why are these fields public

16:43 <headius> yeah that would be good too

16:43 <enebo[m]> heh

16:43 <enebo[m]> single array is doable without breaking an API

16:43 <enebo[m]> but yeah on matchless searches it should use an array with no fields

16:44 <enebo[m]> err just one beg/end

16:44 <enebo[m]> I believe exposting beg/end as public fields will require a major rev and those are pervasively accessed

16:45 <headius> yikes

16:45 <headius> they sure are

16:45 <enebo[m]> I believe the motivation was to "pre inline" what would have been a simple method

16:46 <enebo[m]> and there is a lot of fretting about whether everything is monomorphic

16:46 <headius> so we can't make any improvement here without a big change in JRuby

16:46 <enebo[m]> but the methods have more if stmts here and there to have a single version

16:47 <enebo[m]> not until they allow us to impl [] methods

16:47 <enebo[m]> I do think we could do this largely mechanically if we added signature for it

16:47 <headius> I'm gonna start that process

16:47 <enebo[m]> hahah

16:48 <headius> yeah first step add the accessors

16:48 <headius> deprecate direct field access and release

16:48 <enebo[m]> Is my brain messing with me...isn't there a zero method for primitive arrays?

16:48 <headius> you're got Ruby brain

16:48 <enebo[m]> or Rust brain maybe

16:48 <headius> a zero method?

16:48 <enebo[m]> Rust has ETOOMANYFUNCTIONS

16:49 <enebo[m]> reset all elements in an array to a value

16:50 <headius> ah nope nothing like that

16:50 <headius> Arrays.fill(ary, 0)

16:50 <enebo[m]> aha

16:50 <enebo[m]> yeah was what I was thinking of

16:51 <enebo[m]> If this had had an API for beg/end it sould have combined the arrays and removed the length field

16:51 <headius> yeah that is an easy change once encapsulated

16:51 <enebo[m]> but for things which don't care about more than a single beg/end then specialization would be needed

16:52 <enebo[m]> there is also some historyRoot which I suspect is not normally used

16:52 <headius> yeah either a branch in the accessors or a second Region class

16:52 <enebo[m]> ah yeah it is a feature

16:52 <enebo[m]> another reason to specialize

16:53 <enebo[m]> I guess having a single impl has its benefits

16:53 <enebo[m]> null fields with one impl vs not

16:53 <enebo[m]> I also would like to see region be thread local

16:53 <enebo[m]> maybe

17:23 <headius> looks like some joni releases went out without milestones?

17:24 <headius> https://github.com/jruby/joni/pull/58

17:26 <headius> huh we never set up a CI job for joni so I added one

17:30 <enebo[m]> headius: ah I did not realize we even had milestones for joni

17:54 <headius> yeah they don't get used much but they're there

18:00 <enebo[m]> ok thanks for doing them. I probably will do that next time 😀

18:02 <headius> https://github.com/jruby/jruby/pull/7619

18:03 <headius> initial change to Region accessors is there

18:03 <headius> need to release 2.1.47 with the deprecated fields and then 2.2 for their privatization

18:06 <enebo[m]> headius: heh...wtf is Ptr :)

18:06 <headius> you mind if I move that forward

18:06 <enebo[m]> I don't think so. It is surprising it is that few

18:06 <headius> it's an in/out var for byte index

18:06 <enebo[m]> ah so it needs boxing anyways

18:06 <headius> yeah there may be a more efficient way but smarter use of Ptr instances is best we can do for now really

18:07 <headius> offset changes could possibly be returned in the EConvResult

18:07 <headius> you'd have to manually adjust though

18:07 <enebo[m]> I imagine internally in joni the beg/end will be much larger change set

18:08 <headius> oops

18:08 <headius> this commit doesn't replace any of the reads

18:08 <headius> or the other field accesses

18:08 <headius> pushed too early

18:08 <enebo[m]> yeah I thought there were goign to be a ton

18:08 <headius> it's not that big actually

18:08 <headius> in jojni

18:08 <headius> jonjninionni

18:08 <enebo[m]> ah

18:08 <enebo[m]> I suppose I am just magnifying how much region is accessed/stored

18:09 <enebo[m]> string/regexp are most in JRuby itself although I know we leverage it in some other classes

18:10 <enebo[m]> lopex has entered the chat :)

18:11 <lopex[m]> numbers

18:11 <enebo[m]> I am likely done hacking today once I eat lunch but I almost have my split stuff cleaned up

18:11 <enebo[m]> I realize simpleRegexp detection is fairly complicated

18:12 <enebo[m]> e.g. making regexp into string splits

18:13 <enebo[m]> but my split cleanup has been pretty fun. The MRI port had us doing a lot of things I don't think mattered so I think it is looking a lot easier to understand

18:13 <enebo[m]> passing an 'i' which would only be 0 or 1 and depended on lim determining a boolean limit which would then interact with the 'i'.

18:14 <enebo[m]> I am still not convinced MRI needs that i either but I killed limit and i from any callers

18:17 <enebo[m]> The only obvious point of i I can find is 1 also signifies an explicit limit was passed but it does not use the value for that purpose

18:27 <headius> Yeah sounds good

18:27 <headius> I will roll with this Joni change before my flight and then review remaining 9.4.1 issues

18:27 <headius> My random fix seems nearly there but something is hanging in MRI core suite with it

18:28 <enebo[m]> ok

19:04 <headius> enebo: could you release joni for me pleeeeease

19:04 <headius> apparently I never set up my gpg key on this machine and don't have it handy

19:38 <headius> nevermind I managed

19:55 <headius> hmm

19:55 <headius> those MRI core CI runs are hanging without my branch

19:58 <headius> shite

19:58 <headius> strscan gem accesses Region fields

19:59 <headius> I guess it makes sense but this means updating strscan will break on 9.4.0 and older strscan will not work on 9.4.1

20:01 <headius> well, this will just be a failure until I can address it and we should release 9.4.1 with joni 2.1.47

20:02 <headius> I will yank this commit out of the optimization PR

22:59 <headius> I pushed a PR with the new SingleRegion impl

23:00 <headius> going dark for flight but I might pop back in later