#jruby on 2022-09-21 — irc logs at libera.irclog.whitequark.org

2022-01-19 17:17 ChanServ changed the topic of #jruby to: Get 9.3.3.0! http://jruby.org/ | http://wiki.jruby.org | http://logs.jruby.org/jruby/ | http://bugs.jruby.org | Paste at http://gist.github.com

01:19 adam12 has joined #jruby

01:24 sapphire1896[m] has joined #jruby

01:28 sapphire1896[m] has left #jruby [#jruby]

04:50 sagax has joined #jruby

06:12 curious-antelope has joined #jruby

06:12 curious-antelope has left #jruby [#jruby]

06:47 smudge-the-cat has joined #jruby

06:47 smudge-the-cat has left #jruby [#jruby]

07:49 smudge-the-cat has joined #jruby

07:49 smudge-the-cat has left #jruby [#jruby]

10:07 brometeo[m] has joined #jruby

15:26 <headius> transfer spec only seems to hang in a full run... still trying to narrow it down

15:28 <enebo[m]> https://github.com/jruby/jruby/issues/7367

15:28 <enebo[m]> This is a head scratcher. I know this is on windows and windows is weird with IO already but why would JIT cause this?

15:30 <headius> oh I glanced at it but did not realize it was only happening in jit

15:32 <headius> yup I have no idwa

15:32 <headius> idea

15:33 <enebo[m]> I marked it for 9.3.9.0 since it is a logstash bug

15:33 <headius> I need to get a windows VM up and going again on new machine

15:35 <headius> aha I was not running right specs... Fiber#transfer is in stdlib 'fiber' technically

15:50 <headius> enebo: I can think of no reason why jit would cause that issue... nothing there should be compiled in and it's all done via method calls and constant lookups

15:50 <headius> need to set a breakpoint in StringIO.new at the point it is called from jitted code and see what's up

15:54 <headius> woot I think I have a repro

15:54 <headius> combination of some weird Fiber#resume that is supposed to error (but doesn't) and trying to transfer from the root fiber... seems like some fiber data gets corrupted

16:06 <enebo[m]> headius: I asked a question...why the ENV["test"] = ""

16:06 <headius> Yeah I don't know what that is either and it shouldn't affect anything

16:06 <headius> Are you able to reproduce this?

16:07 <enebo[m]> I need to try it still

16:07 <enebo[m]> I am just confused out of the gate why JIT would cause this

16:07 <enebo[m]> I am not surprised something broke on windows involving IO but the fact JIT causes it makes no sense to me

16:08 <headius> Nor to me

16:20 <enebo[m]> heh...ENV["test"] = "" is the issue...now to figure out why

16:20 <headius> Wut

16:21 <headius> Is it resetting the external encoding somehow?

16:22 <enebo[m]> but only when JIT :)

16:22 <headius> So in that case the jit would compile the whole script, embedding the encodings of those strings, and if that's the old encoding and ENV is somehow altering the global external encoding, that might cause it to reset

16:23 <headius> Oh but the sequence is wrong

16:23 <headius> The updated external encoding happens after that line

16:23 <headius> 🤷‍♂️

16:23 <enebo[m]> yeah fun one

16:24 <headius> Check that the default external encoding has actually been updated after that line

16:26 <enebo[m]> yeah I am just tracing through Encoding.default_external =

16:27 <enebo[m]> This would change it to whatever is UTF_8

16:29 <headius> Yeah I still don't have a good theory. Even if ENV does mess with the external encoding, the next line should reset it

16:29 <enebo[m]> Ok a new clue

16:30 <enebo[m]> I make "" into a string with forced encoding of UTF-8 and it works

16:32 <enebo[m]> heh...progress but still no clue

16:40 <headius> I don't see anything in ENV logic that would mess with default_external

16:45 <enebo[m]> If I move the ENV to the top it does not change the behavior

16:45 <enebo[m]> It is sort of like if ENV is used the default_external is cached in the JIT

16:58 <headius> enebo: some day I'll learn to test old versions first... this hanging transfer spec also hands on 9.3

16:58 <headius> likely never worked quite right but was hard erroring due to missing features when running under 3.0 mode

16:59 <headius> s/hands/hangs/

17:02 <headius> the problem is a bit tricky... if I am getting this right, main thread resumes fiber1, which then resumes the main fiber... does not detect it is in the process of resuming so it goes ahead and transfers control back... but now the main fiber looks like it is supposed to return to fiber1, which is dead, and a subsequent transfer from main fiber to a new fiber hangs waiting for that dead fiber to pick it up

17:02 <headius> it is supposed to error when fiber1 tries to resume main fiber but because main fiber does not have anyone waiting on it, it looks like it is resumable

17:03 <headius> so it corrupts the main fiber

17:03 <headius> basically incorrectly says "this fiber is being waited on by fiber1, so when it transfers, that's the fiber you should return to"

17:05 <headius> may be simple enough to just always error if trying to resume root fiber from a child fiber, since there's no way you could be doing that unless main fiber were already resuming

17:05 <enebo[m]> headius: heh. yeah re not testing older versions

17:29 <headius> enebo: I pushed a fix for this fiber thing that just hard errors if a fiber tries to resume the root fiber

17:30 <headius> that spec now passes and does not muck up the root fiber's state, so the subsequent transfer spec also runs to completion and passes

17:31 <headius> may need an overhaul of this in the shorter term but that's a bigger job... I feel like I finally understand what transfer is (pass control to the target fiber, but have it return to the fiber waiting on me)

17:32 <headius> enebo: could that stringio bug be modifying some global blank string to have that encoding?

17:36 <headius> I think that is the problem

17:36 <headius> https://github.com/jruby/jruby/blob/8ed0a2b109f5064341aa2bf612786eceb4811ec8/core/src/main/java/org/jruby/RubyGlobal.java#L756

17:37 <headius> JIT mode will cache a single instance of ByteList for all strings of the same content in a given file

17:37 <headius> strDup dups the RubyString object but shares the ByteList

17:38 <headius> it may be using a global blank ByteList

17:38 <headius> we probably should have a ByteList subtype or sibling type that is fully immutable for this and frozen string cases, so there's no chance to mutate it accidentally

17:49 <enebo[m]> OMG

17:50 <headius> JIT uses a global static blank bytelist for empty strings

17:50 <enebo[m]> This makes sense...let me try a non-empty string but this string bank maybe is broken?

17:51 <headius> it does mark it shared but getByteList does not unshare so we end up modifying the global blank bytelist

17:51 <headius> so strDup.getByteList just uses the same shared bytelist again

17:52 <headius> and then we set default Locale Encoding into it

17:52 <headius> if that line I linked above is modified to unshare or explicitly dups the bytelist it will be fixed I bet

17:53 <headius> so yeah we need an immutable bytelist type, which would also potentially optimize frozen strings better too

17:53 <headius> supertype bytelist, subtypes mutable and immutable

17:53 <headius> only bimorphic so even older hotspot should still be able to optimize it

17:54 <enebo[m]> I will remove sharing

17:54 <headius> it should only require making all methods call through getters to get ByteList innards, and then we error if setters are used on immutable one

17:55 <enebo[m]> immutability would have prevented this so that is a good idea

17:55 <enebo[m]> but not a today thing

17:55 <headius> heh no

17:55 <enebo[m]> this not sharing on env values is not really a big deal

17:56 <enebo[m]> if you are constantly setting env then you have at least two problems

17:57 <enebo[m]> hmm it may be used for a bit more but I will test this

18:10 <headius> did a quick experiment in refactoring toward a mutable bytelist and it's probably a day's work to split into mutable and immutable

18:11 <headius> main problem is we have a lot of places that `new ByteList` that would need to go through a factory method to get mutable or immutable appropriately

18:11 <headius> or we can do the weak hack of just introducing ImmutableByteList that overrides setters to error rather than having final fields

18:11 <headius> we lose some benefit of truly final fields but it would catch mutation bugs

18:17 <enebo[m]> download bellsoft liberica

18:18 <enebo[m]> my windows machine has ideas

18:19 <enebo[m]> and those ideas are slow and confusing

18:20 <headius> ha

18:21 <headius> not just a simple refactoring... we arraycopy directly into the array and stuff so each method needs to be reviewed

18:21 <headius> we'd need to go through setter and mutator methods for any mutation so that a subtype can error

18:21 <headius> so yeah it is doable but a project

18:26 <enebo[m]> headius: the fix for today is going to just be str.getByteList().shallowDup() right?

18:26 <enebo[m]> we can still share the backing array of whatever byte[] we are making

18:27 <headius> that should be enough since we only mutate the encoding

18:27 <headius> if we get around to this ByteList rework, you would do dupWithEncoding and have more fluent-style mechanisms for getting a new ByteList from an old one that knows about mutability

18:27 <enebo[m]> My windows box is just not loading JRuby no matter what I try. I don't want to spend 10x the time loading the IDE when I can just fix this on the linux side (I will edit with vi to test though)

18:28 <headius> not loading JRuby?

18:28 <headius> like IntelliJ not loading the project?

18:28 <enebo[m]> ultimately I feel we should consider encoding to be part of sharing

18:28 <headius> well the bigger issue is that ByteList has no knowledge of its shared status

18:28 <headius> this all plays into ByteList being too dumb and all the sharing and code range logic living outside of it

18:29 <enebo[m]> yeah I see

18:30 <enebo[m]> ok well I find the concept of bytelist as something to be redone. It is {bytes, index, size, encoding} but it is clear in how we use it that this tuple is not really enough without ripping all the values out of it

18:31 <enebo[m]> or at least we have had hundreds of bugs from this not quite enough and no abstraction

18:31 <headius> yeah this is a classic bug

18:32 <enebo[m]> yep. It tends to be 'begin' but this is the same sort of issue

18:32 <headius> we need to bake frozen/immutable into ByteList all the way down

18:32 <enebo[m]> yeah

18:32 <enebo[m]> we will get that done when we port over to Kotlin

18:32 <headius> you freeze a string, it flips to a frozen ByteList... if you dup that string for mutation it creates a new mutable ByteList, but if you dup it into a new frozen string it can share

18:32 <headius> hah

18:33 <headius> yeah once we hire a team of interns to do all the work for us

18:33 <enebo[m]> I have wondered how much a language with immutability builtin can eliminate checks in its compiled code

18:33 <enebo[m]> Rust eliminates a ton so I am joking about Kotlin but it does know about immutable values

19:05 <enebo[m]> interesting non of that code is even being hit

19:05 <enebo[m]> That has to be the base problem but that specific code is not being hit by the ENV snippet

19:07 <enebo[m]> Hmm no @JRubyMethod for op_aset until RubyHash

19:10 <headius> it is overridden

19:10 <headius> doesn't matter if it is @JRubyMethod if it overrides the op_aset from RubyHash

19:10 <enebo[m]> yeah

19:10 <enebo[m]> It is not being called

19:10 <enebo[m]> so why would that be?

19:11 <headius> op_aset is overridden by StringOnlyRubyHash, are you saying that is not being called?

19:11 <enebo[m]> I have a print in it and even in newString and nothing is printing

19:11 <headius> that calls case_aware_op_aset which calls newName which does the encoding juggling

19:12 <enebo[m]> I have prints in all of those

19:12 <headius> on Windows?

19:12 <enebo[m]> on windows yeah

19:12 <enebo[m]> I am using vi since I cannot get idea to load the project but I can see maven compiling the file I am changing

19:12 <headius> you sure it'

19:12 <enebo[m]> I can also still see the error go away if I don't jit

19:13 <enebo[m]> no

19:13 <headius> it's running the right one?

19:13 <headius> I don't see how it could not be calling this

19:13 <enebo[m]> :)

19:13 <enebo[m]> Maybe I am calling installed one

19:13 <enebo[m]> I will double check

19:13 <headius> []= is op_aset, overridden in that env hash

19:13 <enebo[m]> it wouldn't be the firs time

19:13 <enebo[m]> yeah

19:13 <enebo[m]> I mean I know that

19:13 <headius> yeah

19:13 <headius> I suspect pebkac

19:13 <headius> otherwise I have no idea

19:14 <enebo[m]> haha. I switched about 30 tries beforehand

19:14 <enebo[m]> I hate programming

19:14 <headius> hahah

19:14 <enebo[m]> I did a new command line and just typed it because I am quick

19:15 <enebo[m]> then just hit muscle memory

19:15 <headius> it happens... just last week I could not get a debug breakpoint to fire because I was debugging the wrong codebase

19:15 <enebo[m]> yeah

19:16 <enebo[m]> This reminds me that in something else I was writing I needed to use the word content and all I could write is context

19:16 <enebo[m]> Every time I would try and write the word I would write context

19:16 <enebo[m]> ok well shallowDup fixes it :)

19:17 <headius> several of these WIP specs are basically things that always failed but now there's a new version of the spec for 3.x

19:18 <enebo[m]> yeah I have fixed a few where I know it was broken in 9.3 but it I also fixed 3-4 other problems and those were not broken on 9.3

19:18 <enebo[m]> So I should make smaller commits

19:24 <enebo[m]> headius: so why only windows?

19:26 <headius> it may affect unix too but the default filesystem, locale, and internal encodings are usually all UTF-8 so we don't see it

19:26 <enebo[m]> aha

19:26 <headius> I could not reproduce on unix but I'm not sure how to force the locale encoding to be something different

19:27 <enebo[m]> yeah you are right. This is a problem everywhere but it happens to only be windows which is not already UTF-8

19:28 <headius> I am quickly implementing Thread.ignore_deadlock to do nothing... should that be a verbose or non-verbose warning?

19:28 <enebo[m]> ah I see you answered this already. I was going to comment on this but you did it

19:28 <headius> it is a weird feature

19:29 <enebo[m]> hmm

19:29 <enebo[m]> headius: verbose obviously is a simple answer but it depends on how much someone depends on this feature to do something

19:29 <headius> I don't see a way to implement deadlock detection without overhauling locks and doing a lot of work at those boundaries

19:30 <headius> which is why JDK only does it when you pull a thread dump

19:30 <headius> our default is really "ignore" since we don't fast fail when you walk into a deadlock

19:30 <enebo[m]> look for "ignore_deadlock" on github

19:30 <enebo[m]> If a super common gem does it then people will complain

19:30 <headius> 3.0 feature, I doubt it is used in the wild much yet

19:31 <enebo[m]> I guess start with verbose and we can promote it if anyone ever notices

19:31 <enebo[m]> but if this does end up some common boilerplate in a library then people will complain that they endlessly see this warning on boot

19:32 <headius> nearly all uses are in tests for this feature

19:32 <headius> there are a couple sets in the wild but all seem to be setting to false

19:34 <headius> three instances in the wild that are not tests or specs for the feature

19:35 <headius> https://github.com/corytheboyd/simoneappolloni.com/blob/4493a55c4f6b728280898b043d98a662ed4eb960/app/runner.rb#L10

19:35 <headius> I will look up the feature and the justification...

19:36 <headius> TR no-ops as well

19:37 <headius> https://github.com/oracle/truffleruby/commit/3c5c4cd2cf1c7229591cad41f944465a878ca743

19:37 <headius> they don't warn

19:38 <headius> ffs

19:38 <enebo[m]> in that case

19:39 <headius> it looks like it was added because they have some edge cases where they get false positives, specifically surrounding signal handlers

19:39 <headius> which they run in the current thread still, I believe

19:39 <headius> https://bugs.ruby-lang.org/issues/13768#note-4

19:39 <headius> ioquatix disagrees and so do I but the discussion is otherwise all in Japanese

19:39 <headius> I

19:40 <enebo[m]> meh

19:40 <headius> I will just do it without a warning... should not have added this feature in the first place and instead fixed signal handling

19:40 <enebo[m]> yeah no warning seems right

19:40 <headius> how about notImplemented bit?

19:40 <headius> it would still be callable but if someone checked for it they'd know not to use it

19:41 <headius> we can pass specs with what I have, no warning and notImplemented = true

19:42 <enebo[m]> yeah I guess so

19:42 <enebo[m]> I don't know if anyone will do that but that is a valid strategy

19:44 <headius> can I impl it in Ruby so we don't add more useless methods to Java?

19:44 <headius> I'll push a PR

19:46 <headius> https://github.com/jruby/jruby/pull/7368

19:53 <headius> I'm going to run through and audit the 2.7, 3.0, and 3.1 feature list issues

19:54 <enebo[m]> ok

19:54 <headius> if that PR looks ok I will merge

19:54 <enebo[m]> yeah at some level I wonder about load time with lots of Ruby but that is our long term goal

19:55 <enebo[m]> if this ends up stacking up we might want to dump a single file of all core_ext or something like that

19:55 <headius> it is doable and not too hard, we just haven't done it

19:56 <headius> smash into a single file during build or even precompile

19:56 <headius> I appreciate the concern but it is a general thing we need to fix so I feel like adding one more is not a huge concern right now

20:15 <enebo[m]> oh I agree...just bringing it up

20:31 <headius> enebo: https://bugs.ruby-lang.org/issues/17273

20:31 <headius> this is some new comment pragma that I guess freezes constant values at assignment

20:31 <enebo[m]> ah I have not looked at it at all

20:31 <headius> ok

20:32 <enebo[m]> adding the pragma should be really easy but I suppose we need to mark those nodes

20:32 <headius> seems like it is driven by Ractor interest so I'm not sure how crucial i tis

20:32 <enebo[m]> I figured it was only for ractor but you are right...someone may use it anyways

20:33 <enebo[m]> Actually having seen the examples I am not sure anyone would use it

20:36 <headius> yeah it is weird

20:36 <headius> moving on... I will keep auditing and impl anything really small, and then circle back to the key features we are missing

20:37 <headius> we are close

20:37 <enebo[m]> we need hash identhash impls

20:38 <enebo[m]> TestHash#test_slice_on_identhash [/home/runner/work/jruby/jruby/test/mri/ruby/test_hash.rb:1168]:

20:38 <enebo[m]> <{"str"=>1, "str"=>2}> expected but was

20:38 <enebo[m]> <{"str"=>2}>.

20:38 <enebo[m]> I believe this is used in Rails now

20:39 * headius sent a code block: https://libera.ems.host/_matrix/media/r0/download/libera.chat/9846e8b4cd765bc0bd4905c9f5a1c18f3a21ec57

20:39 <headius> working but a little weird on that output

20:41 <headius> https://github.com/jruby/jruby/issues/7370

20:41 <headius> I will call the feature done and filed this to track the weirdness

20:43 <enebo[m]> I just fixed something on 9.3 which was double warning

20:46 <enebo[m]> I did merge that so it maybe is a similar thing

20:46 <enebo[m]> quick check

20:47 <headius> ok

20:51 <headius> enebo: you implemented that Kernel#clone freezing stuff yeah?

20:53 <enebo[m]> HAHAHAHAHA

20:53 <enebo[m]> headius: yeah

20:53 <enebo[m]> so this is double printing

20:53 <headius> ok

20:53 <enebo[m]> because we are compiling the main script and failing

20:54 <enebo[m]> Or it looks that way. First one is tryCompile and second one is interpreter

20:54 <headius> ahhh ok

20:55 <headius> so it treats the syntax error in compile or interpret and falls back

20:55 <headius> this must be a parser error in CRuby though yeah?

20:55 <enebo[m]> I am tracing this back

20:55 <headius> yield in any class or sclass body should syntax error out

20:56 <enebo[m]> ah yeah precompileCLI will catch any Exception

20:57 <headius> it should propagate RaiseException

20:57 <headius> probably

20:57 <headius> but still think this should happen in parser

20:57 <enebo[m]> well in this case for sure

20:57 <enebo[m]> It happens as a check in IRBuilder

20:57 <enebo[m]> but it could happen in parser although I think this happens in compile.c in MRI too

20:58 <enebo[m]> there are at least a dozen syntaxerror in irbuilder

20:59 <headius> yeah that would be fine too then

20:59 <headius> so it just needs to propagate it

20:59 <enebo[m]> but is there a reason why RaiseException would still work if then interpreted

20:59 <headius> not that I can think of

21:00 <headius> that is only raised for ruby reasons

21:00 <enebo[m]> I can't but I don't trust my mind today :)

21:00 <headius> compile failure in JIT should always be NotCompileableException

21:00 <enebo[m]> we can just never rewrite the JIT in Ruby now :P

21:00 <headius> and I guess I do other Exception in case of compiler bugs

21:00 <headius> NPE or whatever

21:00 <enebo[m]> but yeah we have to use the right exception in JIT

21:01 <headius> I will try a fix

21:01 <enebo[m]> I just added an empty raiseexception catch

21:01 <enebo[m]> compiling

21:02 <enebo[m]> hmm

21:03 <headius> you can catch SyntaxError

21:03 <headius> if you want to keep it narrow

21:04 <enebo[m]> I don't if it doesn't need to be

21:04 <enebo[m]> If for example we live create a Ruby instance in IR builder at some point and it is invalid it can be almost any kind of raise exception

21:04 <enebo[m]> it will not work when interpreted either

21:05 <enebo[m]> so maybe less code is better and broader will reduce doing some stuff twice

21:05 <enebo[m]> diff --git a/core/src/main/java/org/jruby/Ruby.java b/core/src/main/java/org/jruby/Ruby.java... (full message at <https://libera.ems.host/_matrix/media/r0/download/libera.chat/d84881b362d78fc8b25a445968c3e47f323bc30c>)

21:06 <enebo[m]> ARRG :)

21:06 <enebo[m]> anyways this works for the double stuff

21:06 <enebo[m]> I will throw it into a PR

21:06 <headius> yeah fix runWithGetsLoop too because that does the compile as well

21:06 <headius> it probably shouldn't duplicate this logic but it does

21:06 <enebo[m]> ok

21:07 <headius> and it catches Throwable... yuck

21:07 <enebo[m]> haha I was just going to point that out

21:38 <headius> so I am done for today.. 3.0 remaining issues are almost all related to scheduler or ractor, with a few little things here and there and the big one being the prepend/include module stuff

21:39 <headius> I suspect we are down to that and a couple other key items and in good shape

21:41 <enebo[m]> refinements issue somewhere which could even be the include/prepend thing

21:41 <headius> Yeah so that and greening up those library suites and then we can do a review of what remains failing

21:42 <headius> I feel like we got several folks already running off of snapshots