#pypy on 2022-11-01 — irc logs at libera.irclog.whitequark.org

2022-04-07 20:04 cfbolz changed the topic of #pypy to: #pypy PyPy, the flexible snake https://pypy.org | IRC logs: https://quodlibet.duckdns.org/irc/pypy/latest.log.html#irc-end and https://libera.irclog.whitequark.org/pypy | Matti: I made a bit of progress, the tests now only segfault towards the end

00:55 jinsun has quit [Read error: Connection reset by peer]

01:53 epony has quit [Ping timeout: 252 seconds]

01:55 epony has joined #pypy

03:57 epony has quit [Ping timeout: 252 seconds]

03:59 epony has joined #pypy

04:04 jcea has quit [Ping timeout: 246 seconds]

04:11 EWDurbin has quit [Read error: Connection reset by peer]

04:11 graingert has quit [Read error: Connection reset by peer]

04:11 EWDurbin has joined #pypy

04:12 graingert has joined #pypy

04:13 jean-paul[m] has quit [Ping timeout: 246 seconds]

04:13 jevinskie[m] has quit [Ping timeout: 246 seconds]

04:14 Atque has joined #pypy

04:31 jean-paul[m] has joined #pypy

04:33 jevinskie[m] has joined #pypy

06:02 epony has quit [Ping timeout: 252 seconds]

06:05 epony has joined #pypy

06:08 chromebittin has joined #pypy

06:32 chromebittin has joined #pypy

06:32 chromebittin has quit [Changing host]

06:40 chromebittin has left #pypy [#pypy]

06:40 otisolsen70 has joined #pypy

06:41 otisolsen70 has quit [Remote host closed the connection]

06:42 otisolsen70 has joined #pypy

06:49 epony has quit [Quit: QUIT]

07:03 epony has joined #pypy

07:09 epony has quit [Remote host closed the connection]

07:10 epony has joined #pypy

07:15 dmalcolm has joined #pypy

07:16 dmalcolm__ has quit [Ping timeout: 248 seconds]

08:07 otisolsen70 has quit [Quit: Leaving]

08:38 otisolsen70 has joined #pypy

08:39 otisolsen70 has quit [Remote host closed the connection]

08:40 otisolsen70 has joined #pypy

08:43 <antocuni> https://twitter.com/pyblogsal/status/1587146448503808006

08:43 <antocuni> CPython 3.12 will support perf

08:44 <antocuni> and you'll get flamegraphs with Python + C frames

08:46 <cfbolz> Yeah, they use jit maps

08:47 <antocuni> yes but how do they distinguish the various calls to PyEval_EvalFrame? Do they generate a small trampoline for every code object?

08:47 <antocuni> ah, it seems so, it's briefly explained here: https://docs.python.org/pl/dev/howto/perf_profiling.html

08:49 <cfbolz> antocuni: did you find the code yet? Does it mean they have a tiny jit? ;-)

08:50 <antocuni> according to this tweet, yes: https://twitter.com/pyblogsal/status/1587178352976269313

08:50 <antocuni> I remember I wanted to try something like that for vmprof, but I think I never tidd

08:50 <antocuni> *did

08:51 <antocuni> in particular, I think that a JIT is not necessarily needed: in theory, all these trampolines are the same, so you could just compile a "template" C trampoline and memcpy it around, I *think*

08:53 <cfbolz> antocuni: yes but that depends on a huge amount of details too

08:54 <cfbolz> antocuni: why would that help for vmprof, btw?

08:54 <antocuni> yes; for example, it's easy to find the start of a function, but it's hard to find its end :)

08:56 <antocuni> cfbolz: the original implementation of vmprof used a hack to determine the python code object from the C stack frame: IIRC, the first argument of PyEval_EvalFrame was a PyFrameObject*, so we walked it until we found the name of the python function

08:56 <cfbolz> antocuni: ah, for the cpython version

08:56 <antocuni> like, doing f->f_code->co_name or something along those lines

08:56 <antocuni> yes

08:57 <antocuni> but if you have a trampoline, you can basically kill vmprof and use perf :). As they did now

08:57 <antocuni> I guess that at the time having a JIT inside CPython was considered a heresy. Things change :)

08:58 <cfbolz> Perf stack maps are quite a hack, fwiw

08:58 <cfbolz> They aren't easy to use for 'proper' jits

08:59 <antocuni> found the code, btw: https://github.com/python/cpython/blob/0e15c31c7e9907fdbe38a3f419b669fed5bb3b33/Python/perf_trampoline.c#L1

09:08 <antocuni> ah, according to this comment they to exactly what I described above: Notice that for this to work, there must be a unique copied of the trampoline

09:08 <antocuni> per Python code object even if the code in the trampoline is the same. To

09:08 <antocuni> achieve this we have a assembly template in Objects/asm_trampiline.S that is

09:08 <antocuni> compiled into the Python executable/shared library. This template generates a

09:08 <antocuni> symbol that maps the start of the assembly code and another that marks the end

09:08 <antocuni> of the assembly code for the trampoline. Then, every time we need a unique

09:08 <antocuni> trampoline for a Python code object, we copy the assembly code into a mmaped

09:08 <antocuni> area that has executable permissions and we return the start of that area as

09:08 <antocuni> our trampoline function.

09:08 <antocuni> ops sorry, I wanted to copy the link and not the full text :(

09:09 <antocuni> with the difference that they write the trampoline in assembler instead of C

09:11 <antocuni> ah, I see. By writing it in assembler they can put a global symbol which marks the *end* of the function, which was exactly the problem which I stumbled upon too\

09:16 dmalcolm_ has joined #pypy

09:18 <cfbolz> antocuni: you can get that in C too with some tricks

09:18 <cfbolz> But yes, all somewhat messy

09:18 <cfbolz> antocuni: I suppose the trampolines leak in cpython?

09:19 dmalcolm has quit [Ping timeout: 252 seconds]

09:27 <cfbolz> antocuni: I think they have to, perf jit maps don't support reusing an address for a different function later

09:31 <antocuni> I suppose that in theory they could free trampolines when they are not needed and try to remember the address to avoid reusing it. But they are allocated in big arena blocks, so in practice I guess they will always survive and leak (but didn't look at the actual implementation)

09:35 <cfbolz> It's a tiny amount of bytes only of course

13:00 derpydoo has joined #pypy

13:16 jcea has joined #pypy

16:17 derpydoo has quit [Ping timeout: 268 seconds]

16:40 glyph has quit [Quit: End of line.]

16:41 glyph has joined #pypy

16:41 [m]alice is now known as alice

16:51 <cfbolz> mattip: wow, bpnn.py

16:51 <cfbolz> That takes me back

16:52 <cfbolz> That was really before the AI resurgence :-)

16:53 <mattip> someone on stack overflow asked about python ./rpython/translator/goal/translate.py program.py

16:53 <mattip> which got me to try to find where in the documentation we mention that

16:53 <mattip> bpnn was the only place I found

16:54 <mattip> and it isn't even part of the html documentation

16:54 <mattip> https://stackoverflow.com/questions/74265676/pypy-rpython-and-python-versions-compatibility-when-translation-process

17:27 derpydoo has joined #pypy

17:37 derpydoo has quit [Ping timeout: 255 seconds]

17:39 derpydoo has joined #pypy

18:21 antocuni[m] has joined #pypy

18:44 glyph_ has joined #pypy

18:45 glyph has quit [Ping timeout: 252 seconds]

18:45 glyph_ is now known as glyph

20:17 otisolsen70_ has joined #pypy

20:18 otisolsen70_ has quit [Remote host closed the connection]

20:20 otisolsen70 has quit [Ping timeout: 272 seconds]

20:22 derpydoo has quit [Ping timeout: 255 seconds]

20:44 derpydoo has joined #pypy

21:03 epony has quit [Quit: QUIT]

21:05 epony has joined #pypy

22:08 dustinm has quit [Quit: Leaving]

22:17 dustinm has joined #pypy

23:45 jinsun has joined #pypy

23:58 derpydoo has quit [Ping timeout: 255 seconds]