#hpy on 2021-06-21 — irc logs at libera.irclog.whitequark.org

2021-05-27 19:57 antocuni changed the topic of #hpy to: https://hpyproject.org - https://github.com/hpyproject/hpy - IRC logs: https://libera.irclog.whitequark.org/hpy

09:53 <antocuni> ronan: I've tried to compile pypy in the hpy-0.0.2 branch. When running extra_tests/hpy_tests I get random segfaults/aborts/bus errors etc. so I think there is some memory corruption going on

09:53 <antocuni> and the applevel tests all pass

09:53 <antocuni> are you aware of it? Did you investigate already and/or have clues?

11:46 <Hodgestar> antocuni: Re HPyUnicodeBuilder_XXX: Perhaps instead of having many HPyUnicode builders, the HPyBytesBuilder can have a "asStr" or "asUnicode" method that takes one of ASCII, UTF8, UCS{1,2,4} as the encoding and returns a handle to a Python unicode/str object?

13:21 <antocuni> Hodgestar: I'm not sure to understand what you mean. Could you please provide a code example?

13:25 <ronan> antocuni: no, I wasn't aware of the issue

13:26 <antocuni> I'm trying to bisect to find when the bug was introduced, but it looks like extra_tests have not passed for a while

13:29 <antocuni> ronan: e.g., this is what I get with revision 510a797a9fe2 (which was the last changed made by me): http://paste.openstack.org/show/806824/

13:30 <antocuni> however, I don't understand whether it's the same problem as on HEAD or it's different. The traceback look different

13:30 <antocuni> also, the traceback seems weird: it looks like we care calling ctx_HPy_CallTupleDict from HandleManager.close()?

13:40 <ronan> antocuni: I don't remember seeing that sort of failure

13:42 <ronan> and the extra_tests run normally for me with a build from last week (70d6b75c11c7)

13:48 <antocuni> ah, interesting

13:48 <antocuni> let me try that

13:49 <antocuni> this probably means that we should run extra tests nightly

13:49 <antocuni> also, FWIW, I'm translating with -O2

14:05 <antocuni> ronan: nope, I get failures also with 70d6b75c11c7 :( http://paste.openstack.org/show/806826/

14:06 <antocuni> do you have a binary around?

14:07 <ronan> antocuni: yes, in the nightlies

14:07 * antocuni tries

14:14 <antocuni> ronan: if I try to run extra_tests with 70d6b75c11c7 I also get this pytest error: http://paste.openstack.org/show/806828/

14:14 <antocuni> do you confirm you also had to fix it/comment it out?

14:16 <ronan> antocuni: yes, I have that issue if I use pytest 2.9.2, but not with a more recent version

14:17 <antocuni> ah ok. I always use the pytest version which is bundled with pypy, didn't know we can also use a more recent one

14:22 <antocuni> so I confirm that the nightly build passes, but my -O2 custom build aborts

14:22 * antocuni tries to build with -Ojit

14:33 <Hodgestar> antocuni: I can try write a code example. My bigger picture idea though is to avoid having a complex HPyUnicodeBuilder with different builders for different character layouts and to rather use the HPyBytesBuilder and then have a function that hands that buffer to the Python implementation behind the scenes and lets it do the best it can with the format. I think in practice it's not that different to the HPyUnicodeBuilder_XXX, but I'm trying to not

14:33 <Hodgestar> have the API suggest that HPyUnicodeBuilder_UCS4 can be expected to behave as nicely as HPyUnicodeBuilder_UTF8, or that there is really equal support for all five character encoding formats.

14:33 <antocuni> yes, but I'm trying to imagine how such an API would look like

14:34 <antocuni> e.g., we can call HPyUnicodeBuilder_UCS4(size, &buffer), where &buffer is a HPy_UCS4**

14:35 <antocuni> but if we don't distinguish between kinds, what is the type of the buffer?

14:36 <antocuni> And similarly for "size": it represent the number of characters, but the size in bytes depends on the encoding

14:47 <antocuni> so, it seems that we get the RPython abort only with -O2. PyPys translated with -Ojit can run extra_tests smoothly

14:47 <antocuni> even with --jit off

14:47 <Hodgestar> I'm wondering if it is possible to write a streaming encoder that holds only a few bytes of extra storage and converts to whatever encoding the underlying Python likes most?

14:47 ronan has quit [Ping timeout: 268 seconds]

14:48 <antocuni> I wonder how serious is the bug. Is it a real undefined behavior which doesn't manifest with -Ojit just by chance? Or maybe it's a smaller issue which manifests only with the set of options enabled by -O2 and thus will never affect JIT builds?

14:48 ronan has joined #hpy

14:48 <antocuni> Hodgestar: what is a streaming encoder?

14:50 <Hodgestar> antocuni: One feeds in N bytes at a time and the encoder spits out M bytes of output. It's complicated though because we're trying to avoid slow copies. :/

14:50 <antocuni> maybe, but do you have any concrete use case in which it would be useful?

14:57 ronan has quit [Ping timeout: 244 seconds]

14:59 ronan has joined #hpy

15:01 <antocuni> I'm confused. I seemed to remember that -O2 and -Ojit builds differed in more that just the JTT (i.e. the enabled/disabled a slightly different set of objspace options), but it seems that nowadays they are the very same build

15:02 <antocuni> cfbolz: do you confirm? ^^^

15:04 <cfbolz> antocuni: yeah, we tried to unify that a lot

15:04 <cfbolz> Since quite a while actually

15:05 <antocuni> so basically, running pypy --jit off is essentially the same as running a -O2 pypy?

15:05 <cfbolz> antocuni: 'almost'

15:05 <cfbolz> But yes, quite close

15:06 <antocuni> then I have no idea why this bug manifests in a very reproducible way with -O2 but never with -Ojit :(

15:19 <antocuni> ronan: so, assuming that we fix this -O2 but with extra_tests, what is left to do before being able to release hpy-0.0.2?

15:20 <ronan> I think it's ready now

15:21 <antocuni> cool

15:21 <antocuni> but I think I'd like to investigate a bit this problem before releasing

15:22 <antocuni> we also need to make sure that our local copy of hpy is in sync with the git branch release/0.0.2

15:22 <antocuni> fangerer: what is the GraalPython status w.r.t 0.0.2?

15:23 <ronan> antocuni: I think it is

15:23 <antocuni> 🎉

15:23 <antocuni> thanks

16:23 <cfbolz> antocuni: did you try lldebug btw?

16:23 <antocuni> yes; -Ojit lldebug works, -O2 lldebug fails

16:23 <cfbolz> But still with segfault?

16:24 <antocuni> no, RPython AssertionError

16:24 <antocuni> but it's weird. With HEAD, I get different errors (including segfaults) at each run

16:25 <antocuni> with 70d6b75c11c7, I get the same error consistently

16:25 <antocuni> so now I'm not even sure they are the same problem, or two different ones

16:28 <antocuni> ok, nevermind. Now I get some segfaults even with -O2 lldebug

16:34 <antocuni> yes, I confirm I get the weirdest errors, like this: http://paste.openstack.org/show/806834/

16:35 <antocuni> if I comment out test_foo, the test consistently pass. If I include test_foo, I consistently get this error

16:37 <antocuni> uhm wait, this is a FatalError which is raised by the test itself

16:47 <antocuni> ok, I think that what happens is that handles are mapped to "random" W_Objects (or maybe even to random addresses in memory), so all kind of confusion occurs

21:56 mattip has quit [Ping timeout: 272 seconds]

22:11 mattip has joined #hpy