#jruby on 2022-05-17 — irc logs at libera.irclog.whitequark.org

2022-01-19 17:17 ChanServ changed the topic of #jruby to: Get 9.3.3.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

00:00 <headius> I think that should do it if you have rails 7 and sqlite3 installed

00:00 <enebo[m]> I think it will just us AR installed without but I think this merge was not ok

00:00 <enebo[m]> undefined local i

00:01 <enebo[m]> I am trying selects

00:03 <enebo[m]> Is it all benches or only update?

00:03 <headius> seems like they're all weirdly slow

00:03 <enebo[m]> select is about 4-4.7k i/s for mri31

00:03 <enebo[m]> I may be using first 3.1 release and not latest

00:05 <enebo[m]> https://gist.github.com/enebo/c57839e92529ef2b36924bcb5dc61d13

00:05 <enebo[m]> I will update this gist as I get more results

00:06 <headius> try update and see if MRI is even close to us

00:06 <headius> for me it is like two orders of magnitude slower

00:06 <enebo[m]> ok I am running indy on same bench for us atm

00:07 <headius> I am trying to get 2.7 installed but it is not compatible with openssl 3

00:07 <headius> lord how do people deal with this env

00:08 <enebo[m]> JRuby updated in gist

00:08 <enebo[m]> HAHAHA

00:08 <headius> you see how the rehearsals just slow down?

00:08 <enebo[m]> it is slowing down in real time in rehearsal

00:09 <enebo[m]> Something is growing here

00:09 <enebo[m]> This update I think is a weird form

00:09 <enebo[m]> because it is a hash update

00:09 <enebo[m]> and I don't think this is how people normally update data

00:09 <headius> hmm

00:09 <headius> maybe?

00:09 <enebo[m]> I have to wonder if this is just no finalizing a ocmmit

00:10 <headius> I tried putting all the updates in transactions and it did not change

00:10 <headius> transaction per iter

00:11 <headius> landing now but I will be working on this until I get something I can use

00:15 <headius> 2.7.6 behaves the same

00:15 <headius> bbiab

00:16 <enebo[m]> headius: I believe I have part of the answer. I moved the ```record = BenchRecord.create.reload``` into the block and that obviously slowed down the bench a lot but I no longer see any slow down

00:17 <enebo[m]> So what I am pretty sure is happening is JRuby is implicit commit on update and MRI is saving all these changes and then going through them all some day

00:17 <enebo[m]> so the perf difference is a fucking ginormous changeset which is never applied

00:59 <enebo[m]> kares: So whatever reason .update(hash) is not committing and it is just growing some massive set of changes

00:59 <enebo[m]> I changed it to record.send(field + "=", value); record.save! and we got reasonable behavior from MRI

01:00 <enebo[m]> Ultimately I think this bench should be record.a_boolean = true; record.save! with one measure per field

01:09 <headius> I will try doing the save with the hash version of the update and hopefully that will also fix it

01:09 <headius> We really don't need these benchmarks to be quite so meta so we could just hard code them to the field names

01:09 <headius> Or for Pete's sake, aren't there good benchmarks out there for active record yet?

02:25 <headius> yeah it is something weird with the hash update version

02:25 <headius> I wonder why we don't slow down

02:28 <headius> enebo: save! isn't needed to fix the slowdown, just switching to the send version eliminates the problem

02:28 <headius> but of course it is not saving, and save cuts the IPS way down

02:29 <headius> we are still comfortably 2x faster in both cases but who knows if this bench is still valid

02:29 <headius> it's doing something

02:40 subbu has joined #jruby

02:52 subbu has quit [Ping timeout: 260 seconds]

02:55 subbu has joined #jruby

03:36 subbu has quit [Quit: Leaving]

05:25 <kares[m]> <enebo[m]> "kares: So whatever reason ...." <- hmm I guess it detects no changed are made - the previous code with multiple record entries tried to get smart about this

05:47 <kares[m]> <enebo[m]> "I changed it to record.send(..." <- that should be the same as update! field -> value ... which is pretty much update field -> value but with a raise if validation fails

05:48 <kares[m]> guess the best approach here would be to have enough records up front to update ...

05:48 <kares[m]> that or making sure the value always changes - but that could interfere as to how expensive the op to generate the next value is

05:49 <kares[m]> think it's best to rely on the create use-case for AR ... the update use-case is still relevant in terms of Ruby perf -> most ops end up no-op in terms of a DB operation

05:49 <headius> I'm going to hit up a few folks this week and see if I can figure out what they're using to optimize active record on the rails side

05:49 <headius> There's got to be better benchmarks that are known to work properly and test these things

05:49 <headius> For what it's worth the update benchmark seems to be the only problematic one

05:50 <kares[m]> yeah I looked at some Rails benchmarks but the update code isn't better

05:50 <headius> It's just the one that most interested in because it would create the least amount of objects

05:51 <kares[m]> yeah - one think we could do is disable the dirty tracking ... it would mess with AR internals a bit but would make sure each change is considered a DB operation

06:14 <headius> is that something we are doing differently than the CRuby code?

06:22 <headius> The update numbers might be affected by that but I am seeing a similar 2:1 ratio for selects too... we seem to be doing very well on overall AR perf

06:23 <headius> the end to end view numbers are not 2x but still comfortably ahead

07:28 <kares[m]> another update (with numbers): https://github.com/jruby/activerecord-jdbc-adapter/pull/1116

07:28 <kares[m]> this enforces the update operation and numbers should be more realistic ...

07:29 <kares[m]> JRuby is faster than MRI, except for the `update!(all_fields)` case, which is interesting.

07:29 <kares[m]> indy is almost double the speed of non-indy except again the `update(all_fields)` case which seems to degrade compare to non-indy (must be a lot of invalidation going on or what not)

07:33 <kares[m]> enebo: not sure how the `record.send(field + '=', value); save!` trick worked - would have expected that to not issue an update if record stayed the same

07:33 <kares[m]> the PR I mentioned above achieves the forced DB update with some dirty AR tricks

07:33 <kares[m]> literally dirty tricks ... so I guess it's dirty dirty AR tricks 😉

08:09 drbobbeaty has quit [Quit: Textual IRC Client: www.textualapp.com]

08:33 drbobbeaty has joined #jruby

13:34 <enebo[m]> kares: so if MRI realizes it is the same value it saves it somewhere and keeps growing smoething?

13:34 <enebo[m]> That is what I am confused about. Why would it get progressively slower with update unless it was saving that state into something which is continuously growing?

13:36 <enebo[m]> My wife said the same thing to me this morning that it detecting no change may mean no operation but that does not explain the continual slowdown

13:37 <enebo[m]> and save! might not be changing the db but without save! the results were millions of i/s which is consistent with no updating. save! brought it back into the ballpark of other results (although maybe a little higher)

13:40 <enebo[m]> I guess one other thing she said was record.field = also performs validation but I would assume update{,!} does too

14:05 <headius> good morning

14:08 <headius> kares: the numbers in that PR still slow down as they progress and are an order of magnitude slower than the JRuby results for most of them

14:09 <enebo[m]> headius: Since you must have ran that can you look at memory of that process

14:10 <headius> the results with enebo's send seem more naturally 2x or so and don't slow down linearly

14:10 <enebo[m]> I never bothered because the behavior seems like it must just be a growing array

14:10 <headius> enebo: that's a thought... I can reset your changes locally and see about the memory

14:10 <enebo[m]> I cannot really think of anything else which explains a slow down like that

14:11 <enebo[m]> some data structure is just growing and never cleaning up

14:11 <enebo[m]> headius: so update! is doing it as well?

14:11 <enebo[m]> I had thought I tried that last night but I don't recall now

14:14 <kares[m]> <enebo[m]> "My wife said the same thing to..." <- nice!

14:14 <headius> I have not run the numbers in that PR but you can see them there

14:14 <headius> the warmup phase has each case slower than the previous

14:14 <kares[m]> <enebo[m]> "kares: so if MRI realizes it..." <- which one are you confused about here the `update(all_attrs)` seems like a bug to me - not sure what's going on underneath, the others I can explain 😉

14:15 <kares[m]> a Rails bug of some kind - haven't looked really just assumed a growing internal Hash/Array somewhere would cause a liner slowdown

14:16 <enebo[m]> yeah it must be a bug

14:17 <enebo[m]> record.field = value is insanely fast without save so I assume that is not committing and not doing anything brcauise value is the same

14:17 <enebo[m]> s/and/or

14:17 <kares[m]> record.field = value is just an assingment ... there's update_attribute if you intent to also save

14:17 <enebo[m]> It more or less is method call fast

14:18 <kares[m]> even record.field = value; record.save! is going to be fast is you run it with the same value twice - only the first save will happen

14:18 <enebo[m]> ok that is what my wife said at first...you can just use update_attribute

14:18 <kares[m]> AR tracks the values and if they did not change in the record object save! is a noop

14:18 <enebo[m]> but it is much much slower than a normal call

14:18 <enebo[m]> So I guess that is just value tracking overhead

14:19 <kares[m]> which one the update_attribute or the save?

14:21 <enebo[m]> the save!

14:21 <enebo[m]> Adding record.save! reduces results from 1M i/s to about 1600 i/s

14:21 <enebo[m]> which is close to other results which are accessing the database

14:22 <enebo[m]> but it may not actually be saving anything

14:22 <enebo[m]> I just observed it is close to a db operation speed so I assumed it was

14:26 <kares[m]> yep that's the bug - underneath `r.update(field => value)` and `r.field = value; r.save` are doing the same thing

14:27 <headius> wow, graalvm community edition download for linux is 422MB

14:27 <headius> that seems a little big

14:35 <kroth_lookout[m]> ClassCastException issue daveg_lookout mentioned a bit ago: https://github.com/jruby/jruby/issues/7219

15:09 <headius> Ah so this is the regression in extending a Java class

15:09 <headius> Thank you for the report

15:42 fidothe_ has joined #jruby

15:44 Freeky has joined #jruby

15:49 kares[m] has quit [*.net *.split]

15:49 ccmywish[m] has quit [*.net *.split]

15:49 nirvdrum[m] has quit [*.net *.split]

15:49 JasonRogers[m] has quit [*.net *.split]

15:49 Freaky has quit [*.net *.split]

15:49 fidothe has quit [*.net *.split]

15:50 fidothe_ is now known as fidothe

15:57 nirvdrum[m] has joined #jruby

15:58 JasonRogers[m] has joined #jruby

15:58 ccmywish[m] has joined #jruby

16:06 kares[m] has joined #jruby

16:26 <headius> enebo: did you post some AR test results?

16:27 <headius> finally putting the slides together

16:27 <enebo[m]> headius: yeah somewhere...just a sec

16:28 <enebo[m]> oh I guess not a gist

16:28 <enebo[m]> I may have to run them locally

16:29 <enebo[m]> yeah I lost the window which had the results...running now

16:29 <headius> just sqlite would be fine for now, to show that we can be pretty close

16:29 <enebo[m]> you have the other ones from that gist

16:29 <headius> I do

16:30 <enebo[m]> One result is from adding require 'bigdecimal' but that is likely a lucky side-effect on MRI's side

16:31 <enebo[m]> It would be a nice feature to be able to ask where the first require happened for a file

16:31 <enebo[m]> I can compile MRI with a printf but that is a pain

16:31 <headius> yeah we can figure out where that is and make a trivial patch

16:31 <headius> they probably just don't realize they aren't requiring it somewhere it is needed

16:32 <enebo[m]> I often have this generic wish. I want to ask when something happened

16:32 <enebo[m]> like we need a language which can instrument itself and then be queried like sql

16:33 <headius> yeah I have too many slides already basing this on 2019

16:33 <headius> 30 minutes is not long

16:34 <enebo[m]> It is interesting how much send is used in rails now

16:34 <enebo[m]> this used to all be generated methods

16:35 <enebo[m]> AR still running. This is a surprisingly long test run

16:43 <enebo[m]> headius: 7648 runs, 21140 assertions, 34 failures, 15 errors, 18 skips

16:43 <headius> cool

16:44 <enebo[m]> half of these are not a big issue

16:44 <enebo[m]> we don't support async and bignum out of range sorts of things

16:45 <enebo[m]> lunch

16:46 <headius> yeah no worries

16:58 Freeky is now known as Freaky

17:02 bastelfreak has joined #jruby

21:06 <headius> tenderlove recommended looking at the AR benchmarks in railsbench

21:08 <headius> I also removed the travis build badge from jruby.org... we'll probably want to replace that with something pointing at current stable branch