#pypy on 2018-04-29 — irc logs at freenode.irclog.whitequark.org

2018-02-26 15:52 cfbolz changed the topic of #pypy to: PyPy, the flexible snake (IRC logs: https://botbot.me/freenode/pypy/ ) | use cffi for calling C | the secret reason for us trying to get PyPy users: to test the JIT well enough that we're somewhat confident about it

00:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-s390x/builds/850 [default]

00:04 jcea has quit [Quit: jcea]

00:06 jcea has joined #pypy

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3744 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5449 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6726 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5839 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4605 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/2252 [py3.5]

01:00 <bbot2> Started: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/2012 [py3.5]

01:02 jcea has quit [Quit: jcea]

01:03 jcea has joined #pypy

01:16 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-v7/builds/1510 [default]

01:16 <bbot2> Started: http://buildbot.pypy.org/builders/jitbackendonly-own-linux-armhf-v7/builds/1477 [default]

01:16 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-armhf-v7/builds/1321

01:26 marr has quit [Ping timeout: 248 seconds]

01:32 jcea has quit [Quit: jcea]

01:44 tbodt has joined #pypy

02:01 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-32/builds/4605 [py3.5]

02:01 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armel/builds/2252 [py3.5]

02:04 <bbot2> Success: http://buildbot.pypy.org/builders/build-pypy-c-jit-linux-armhf-raspbian/builds/2012 [py3.5]

02:07 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-x86-64/builds/5449 [py3.5]

02:14 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3881 [py3.5]

02:23 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-64/builds/6726 [py3.5]

02:48 [Arfrever] has quit [Quit: leaving]

02:52 [Arfrever] has joined #pypy

02:58 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

03:00 <bbot2> Started: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-64/builds/2521

03:00 <bbot2> Started: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-32/builds/3315

03:28 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-macosx-x86-64/builds/3744 [py3.5]

04:13 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-x86-32/builds/5839 [py3.5]

04:26 adamholmberg has joined #pypy

04:32 adamholmberg has quit [Ping timeout: 240 seconds]

05:00 mattip has joined #pypy

05:39 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-v7/builds/1510 [default]

05:39 <bbot2> Started: http://buildbot.pypy.org/builders/pypy-c-jit-linux-armhf-v7/builds/1511 [py3.5]

06:21 dddddd has quit [Remote host closed the connection]

06:29 adamholmberg has joined #pypy

06:32 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3880 [default]

06:33 adamholmberg has quit [Ping timeout: 240 seconds]

06:37 the_drow has quit [Ping timeout: 264 seconds]

06:38 lesshaste has joined #pypy

06:38 <lesshaste> hi

06:41 <lesshaste> is cffi the right way to call C++ code from both pypy and python?

06:55 lazka has joined #pypy

07:19 inad922 has joined #pypy

07:24 <bbot2> Success: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-32/builds/3315

07:28 jaffachief has joined #pypy

07:30 adamholmberg has joined #pypy

07:32 R3d_Sky has joined #pypy

07:32 <R3d_Sky> hello

07:33 <simpson> Hi.

07:33 <R3d_Sky> i've been testing out a silly function as a microbench on pypy - and it appears to be 100x slower than the same test run on python3 with perf

07:33 <R3d_Sky> s/python3/cpython

07:34 <R3d_Sky> is pypy just not very good at handling small functions run via perf? because I add some extra code to make the function do stuff, and pypy beats cpython by 50% in my perf tests

07:35 adamholmberg has quit [Ping timeout: 240 seconds]

07:35 <simpson> PyPy's designed for real workloads rather than microbenchmarks.

07:36 inad922 has quit [Ping timeout: 240 seconds]

07:37 <mattip> lesshaste: you might want to take a look at cppyy

07:41 <mattip> if you want to use cffi, you will have to create wrappers for class methods, and pass a void* for the class instance as an argument

07:42 <mattip> and no dynamic dispatch, you need a separate wrapped function for each specialization

07:45 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-win-x86-32/builds/3881 [py3.5]

07:52 lazka has quit [Quit: Leaving]

08:00 marr has joined #pypy

08:26 antocuni has joined #pypy

08:34 the_drow has joined #pypy

08:37 jaffachief has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

08:48 <bbot2> Failure: http://buildbot.pypy.org/builders/own-win-x86-32/builds/1769 [default]

09:09 lazka has joined #pypy

09:17 energizer has quit [Quit: Leaving]

09:24 <the_drow> Hi guys, I tested PyPy3.5-6.0.0 with the new cpyext improvements. It seems that python-rapidjson does not show an improvement comparing to 3.5-5.10.1. I'll write a blog post about this later so I want to find out exactly why that's not the case.

09:24 <the_drow> If anyone wants to help me figure this out, I'd love some help. And you'll get the credit of course.

09:29 <antocuni> the_drow: it's entirely possible that certain libs do not show any cpyext speedup

09:30 <the_drow> I'm fully aware

09:30 <the_drow> But I want to provide others with a sense of what these speedups do and what they don't

09:31 <antocuni> the CPython C API is huge, and consists of hundreds of functions/macros; what we basically did was to find a general way to make them faster, and to implement this speedup for a handful of cases

09:31 <the_drow> The only thing that this speed up improves is calling Python functions written in C right?

09:31 <antocuni> yes, the cost of function (and method) calls is vastly reduced

09:32 <antocuni> however, if once you are in C you do tons of API calls, it's possible that you spend lot of time there

09:33 <antocuni> it's a bit hard to say; what we usually do is to run a benchmark (or, even better, a microbenchmark) using callgrind, and see where we spend most of the time

09:33 the_drow_ has joined #pypy

09:34 <the_drow_> antocuni, sorry was disconnected. python-rapidjson spends most of it's time building Python objects

09:34 <antocuni> yes exactly; this is one of the thing which is still not fully optimized

09:34 <the_drow_> That's probably why the speed up is not significant in this case

09:35 <the_drow_> I wish PyPy would have used RapidJson directly.

09:35 <antocuni> yes, I guess so

09:35 <the_drow_> Instead of using a custom implementation

09:35 <antocuni> I suppose we are willing to consider a PR :)

09:35 <the_drow_> But that would be very hard to do with RPython and introduce C++ as a dependency

09:35 <the_drow_> I thought of that

09:35 <the_drow_> But I'm not sure adding C++ as a build dependency is a good idea

09:35 <antocuni> probably not

09:35 the_drow has quit [Ping timeout: 240 seconds]

09:36 <antocuni> also, what is the performance of rapidjson compared to e.g. ujson?

09:36 <the_drow_> much much faster

09:36 <the_drow_> I'll send you a link to the benchmark

09:36 <the_drow_> and then you have the problem where you can't use CFFI because that requires us to hop between Python and C++.

09:37 <antocuni> you can try cppyy

09:37 <the_drow_> cppyy doesn't work in this case because rapidjson is too modern

09:37 <the_drow_> I was just saying ;)

09:37 <antocuni> ah

09:37 <the_drow_> antocuni, this was the trigger to tell Wim to upgrade cling and do all that refactoring he did

09:37 <antocuni> anyway, I doubt that cpyext will ever be so fast to make python-rapidjson faster than an rpython or rpython+c implementation

09:38 <the_drow_> antocuni, https://github.com/python-rapidjson/python-rapidjson#user-content-serialization

09:39 <the_drow_> What if we make rapidjson optional?

09:39 <the_drow_> And keep our working JSON module?

09:39 <the_drow_> Is PyPy's build system ready for such a thing?

09:40 <antocuni> yes, you can select whether or not to compile a module when translating

09:40 <the_drow_> how would such an implementation look in RPython?

09:41 <antocuni> the hard part is that we don't have a good way to call c++ from RPython

09:42 <antocuni> so you probably need to expose a C API first

09:42 <antocuni> and then call this C API with rffi

09:42 <the_drow_> antocuni, I tried using CFFI but because the GIL is released all the time that doesn't work

09:42 <antocuni> rpython uses rffi, which is conceptually similar to cffi but implemented very differently

09:42 <the_drow_> There's an interface for RapidJSON that calls back when it encounters an object, a number, a string etc.

09:43 <the_drow_> So if I implement something with cffi I can port it pretty easily?

09:43 <antocuni> look e.g. at the implementation of pypy/module/_ssl

09:43 <antocuni> it wraps openssl

09:43 <antocuni> the rffi bindings are inside rpython.rlib.ropenssl

09:44 <antocuni> the_drow_: you are making confusion

09:44 <antocuni> you can write a cffi module if you want

09:44 <antocuni> this module would work on cpython and pypy, you can put it on pypi and install it with pip

09:45 <the_drow_> antocuni, but that doesn't work because CFFI releases the GIL and we skip a lot through Python and C++ space

09:45 <the_drow_> antocuni, I'm fully aware of that fact.

09:45 <antocuni> ok, then forget about cffi

09:45 <antocuni> if you want to write an rpython module, then it's pypy-only, needs to live inside pypy/module and can be compiled only when you translate the full pypy

09:46 <the_drow_> So far so goo

09:46 <antocuni> for calling C inside rpython, you use rffi

09:46 <the_drow_> good

09:46 <the_drow_> And it's the same API as CFFI?

09:46 <antocuni> no

09:46 <antocuni> look at _ssl

10:01 asmeurer_ has quit [Quit: asmeurer_]

10:09 <the_drow_> How is libssl_SSL_new for example defined?

10:10 <the_drow_> and can I include some C code for the wrapper

10:10 antocuni has quit [Ping timeout: 276 seconds]

10:16 <the_drow_> arigato, ping?

10:17 <the_drow_> I actually think there's an easier patch here

10:17 <the_drow_> But I need to use SIMD from RPython if possible

10:19 <the_drow_> I'll be back later

10:23 <bbot2> Success: http://buildbot.pypy.org/builders/jit-benchmark-linux-x86-64/builds/2521

10:42 amaury has joined #pypy

10:55 inad922 has joined #pypy

10:55 <arigato> the_drow_: we recommend against using rffi. better try to use cffi and complain if you have specific performance benchmarks

10:56 <arigato> e.g. yes, it releases the gil and reacquires it all the time, which is why that has been heavily optimized already and shouldn't cost a lot now

10:57 amaury has quit [Remote host closed the connection]

10:59 <arigato> and yes, cffi is not really meant to work with a huge number of Python objects handled by the C side, but you can often reorganize things

11:06 R3d_Sky has quit [Quit: Textual IRC Client: www.textualapp.com]

11:07 lazka has quit [Quit: Leaving]

11:33 kipras is now known as kipras`away

12:17 lazka has joined #pypy

12:29 kipras`away is now known as kipras

12:38 dddddd has joined #pypy

13:03 adamholmberg has joined #pypy

13:07 adamholmberg has quit [Ping timeout: 240 seconds]

13:10 redj_ is now known as redj

13:23 <the_drow_> arigato, we already tried CFFI with hiredis (remember that one?) and rapidjson. It is impossible to reach to CPython level of performance with CFFI for parsers.

13:25 <the_drow_> arigato, Maybe if CFFI had an instruction to avoid releasing the GIL we could have made it work...

13:31 <the_drow_> arigato, I think the main problem is not the GIL but the fact that we have to use callbacks.

13:34 <the_drow_> We tried it with https://github.com/redis/hiredis-py/pull/46/files and we were 1k operations below the pure python implementation.

13:38 <the_drow_> What I'd really like to do is to implement https://github.com/Tencent/rapidjson/blob/master/include/rapidjson/reader.h#L281 in https://bitbucket.org/pypy/pypy/src/4a06e23782cf5422ad811c7c70eb7d2159961892/pypy/module/_pypyjson/interp_decoder.py?at=default&fileviewer=file-view-default#interp_decoder.py-86

13:38 <the_drow_> If it's supported

13:38 <the_drow_> That way we won't have to introduce an optional dependency and get a faster JSON parser

13:46 <lesshaste> mattip, that's interesting.. much to learn it seems

13:48 <the_drow_> So does anyone know how do I detect which intrinsics are supported at runtime using rffi or how to call such intrinsics?

13:48 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-jit-linux-s390x/builds/850 [default]

13:49 <the_drow_> Rapidjson is MIT licensed so I can just copy/paste the code

13:49 lritter has joined #pypy

13:50 <lesshaste> mattip, I just want to call the function called hafnian here https://bpaste.net/show/a57ae4198760

14:11 R3d_Sky has joined #pypy

14:33 <the_drow_> I'm not sure I need rffi for those. Can the _pypy_json module use CFFI code?

15:56 <the_drow_> arigato, ping me when you do have more than a moment please :)

15:57 jaffachief has joined #pypy

16:33 R3d_Sky has quit [Quit: Textual IRC Client: www.textualapp.com]

17:00 asmeurer__ has joined #pypy

17:05 asmeurer__ has quit [Ping timeout: 268 seconds]

17:06 <arigato> I'm a bit confused

17:06 <bbot2> Failure: http://buildbot.pypy.org/builders/jitbackendonly-own-linux-armhf-v7/builds/1477 [default]

17:06 <arigato> cffi and rffi are very, very different beasts for a similar purpose

17:07 <arigato> rffi is only available when you translate a new pypy from scratch

17:07 <arigato> while it is certainly possible to make something in rffi, it requires rpython knowledge and it won't be available in standard pypy's

17:11 <arigato> here's one way I'd consider to try, using cffi, which should work on pypy or cpython but be definitely slow on cpython but really fast on pypy:

17:11 <arigato> you write C code whose purpose is to parse the text and write a "parsed representation" in a big buffer

17:12 <arigato> the parsed representation is basically defined with a bunch of C structures, repeated as needed, with maybe some pointers or maybe not (i.e. all inline as binary data)

17:13 <arigato> so once the text is parsed, you have this big buffer (only one call to C); then the Python side can read it using ffi pointers

17:14 <arigato> the trick here is that on the Python side, you write classes that are implemented with just one pointer each, and that lazily build more such classes when you access more items

17:15 <arigato> so for example, say you want to parse some text that is a list of 2d points, "[(x=5.6, y=7), (x=7, y=-2)]"

17:15 <arigato> you do that with C code that emits a big array of "struct point { float x, y; }", say

17:16 <arigato> then on the Python side, you make two classes: ListOfPoints and Point

17:16 <arigato> ListOfPoints has got a __getitem__ which checks the bound and returns a fresh Point

17:17 <arigato> the Point class is simple enough so __getitem__ would just read the x and y values from the C data and stick them on the Point instance

17:17 <arigato> if Point were more complicated, __getitem__ would compute a ffi pointer to the item, and pass that to the Point instance

17:18 <arigato> and then the Point instance itself would have properties x and y that would read the x and y values from the ffi pointer

17:18 <arigato> you do that without caching the Point instances

17:19 <arigato> it looks indirect, but PyPy is very good at optimizing that; it will make Point instances temporarily and free them as soon as they are no longer needed---or the JIT can completely remove the creation of the Point instances if it sees that it doesn't survive for long

17:19 <the_drow_> arigato, What I want to do is to port the SIMD code that skips whitespaces from RapidJSON to PyPy

17:19 <the_drow_> There's no need to actually bind RapidJSON into PyPy

17:20 <arigato> ok, then I completely misunderstoof you, sorry :-)

17:20 <the_drow_> I only wanted RapidJSON because the code is already there

17:22 <arigato> maybe look at rpython.rlib.longlong2float.uint2singlefloat for an example

17:22 <the_drow_> However, there are a few things here I need to figure out: 1) I have not seen one line of C code in the PyPy code base so I'm wondering if I should write the headers in C and use them with rffi. 2) I'm not sure if we really need C code. If we could simply import the right SIMD headers and use rffi to call to the intrinsics that would be better no?

17:23 <the_drow_> Oh I can use C code like that

17:23 <the_drow_> Ok

17:23 <the_drow_> Got it

17:24 <the_drow_> arigato, is there a ptrdiff_t in rlib?

17:24 <the_drow_> RapidJSON uses pointer arithmetics while we use an index

17:24 <the_drow_> See https://github.com/Tencent/rapidjson/blob/master/include/rapidjson/reader.h#L281

17:25 <arigato> use the C type called "Signed", which maps to plain integers in RPython, and which is large enough to cast pointers to it

17:25 <the_drow_> but we do have size_t so we should have ptrdiff_t

17:26 <arigato> (1) there is little consistency about that in rffi, (2) what about Windows

17:26 <the_drow_> ptrdiff_t is C99 I think

17:26 <arigato> not MVSC

17:27 <arigato> MSVC

17:27 <the_drow_> https://www.tutorialspoint.com/c_standard_library/stddef_h.htm

17:27 <the_drow_> arigato, It's there. https://msdn.microsoft.com/en-us/library/323b6b3k.aspx

17:28 <arigato> VC 2008 too?

17:28 <the_drow_> No idea about that

17:28 <arigato> precisely

17:28 <the_drow_> But that's pretty old

17:29 energizer has joined #pypy

17:29 <arigato> it's the one used to build cpython extension modules in C

17:29 <the_drow_> I think they moved on to 2010 somewhen

17:29 <the_drow_> But I'm not 100% sure

17:29 <the_drow_> anyway

17:30 <the_drow_> How will we choose the right SIMD intrinsics? At startup time or at compilation time?

17:31 <the_drow_> The former can be done with a C function but I only know how to do it on Linux

17:33 <the_drow_> arigato, Will a POC with SIMD 4.2 help us?

17:33 <arigato> RapidJSON doesn't contain the necessariy logic?

17:34 <the_drow_> arigato, at compile time...

17:34 <the_drow_> but that's not good for an interpreter

17:34 <the_drow_> and the logic is tied to their build system

17:36 the_drow__ has joined #pypy

17:36 the_drow__ is now known as the_drow

17:36 <the_drow> arigato, sorry

17:36 <the_drow> stupid internet

17:39 the_drow_ has quit [Ping timeout: 240 seconds]

17:39 <arigato> that's a question I could probably help with but it requires some investigation, I'm sure other people know it already

17:40 <arigato> also, I think it's at least worthwhile to try the approach I described above for json parsing

17:40 <arigato> of course it's completely different

17:41 jaffachief has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

17:41 <the_drow> I'm trying to bring myself some value but I really want some of my work in PyPy

17:41 <the_drow> even if I don't know what I'm doing

17:41 <the_drow> :P

17:41 <the_drow> I'm opening an issue about it if you don't mind

17:43 <arigato> I mean, the last question you asked has got nothing to do with pypy, it's "how do I write detection of sse features in portable C"

17:44 <arigato> I bet stackoverflow knows that

17:44 <the_drow> arigato, it knows. on linux...

17:45 <the_drow> found something on windows

17:45 <the_drow> https://stackoverflow.com/questions/6121792/how-to-check-if-a-cpu-supports-the-sse3-instruction-set

17:45 <the_drow> but I'd rather have a working POC first

17:45 <the_drow> I'll give it a shot soon

17:45 <the_drow> anyway gotta go

17:46 <the_drow> arigato, we can continue this discussion on the mailing list.

17:50 the_drow has quit [Ping timeout: 240 seconds]

18:01 raynold has quit [Quit: Connection closed for inactivity]

18:06 nunatak has joined #pypy

18:10 dddddd has quit [Ping timeout: 256 seconds]

18:16 Taggnostr has quit [Remote host closed the connection]

18:19 Taggnostr has joined #pypy

18:19 lazka has quit [Quit: Leaving]

18:23 dddddd has joined #pypy

19:02 asmeurer_ has joined #pypy

19:06 marky1991 has joined #pypy

19:06 asmeurer_ has quit [Ping timeout: 248 seconds]

19:07 tonyseek has joined #pypy

19:29 energizer has quit [Disconnected by services]

19:29 energizer has joined #pypy

19:29 energizer has quit [Remote host closed the connection]

19:30 energizer has joined #pypy

20:04 exarkun has quit [Read error: Connection reset by peer]

20:13 exarkun has joined #pypy

20:16 tbodt has joined #pypy

20:23 asmeurer_ has joined #pypy

21:01 inhahe__ has quit [Ping timeout: 240 seconds]

21:08 oberstet has quit [Ping timeout: 240 seconds]

21:11 inhahe__ has joined #pypy

21:13 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

21:16 inhahe__ has quit [Ping timeout: 248 seconds]

21:34 tbodt has joined #pypy

21:37 <bbot2> Failure: http://buildbot.pypy.org/builders/pypy-c-app-level-linux-armhf-v7/builds/1321

21:43 jaffachief has joined #pypy

21:44 nunatak has quit [Quit: Leaving]

21:55 jaffachief has quit [Quit: My MacBook has gone to sleep. ZZZzzz…]

22:00 <bbot2> Started: http://buildbot.pypy.org/builders/own-linux-s390x/builds/908 [default]

22:05 tonyseek has quit [Ping timeout: 264 seconds]

22:08 tonyseek has joined #pypy

22:21 oberstet has joined #pypy

22:32 oberstet has quit [Ping timeout: 240 seconds]

22:37 tonyseek has quit [Quit: tonyseek]

23:00 danieljabailey has quit [Quit: ZNC 1.6.5+deb2build2 - http://znc.in]

23:00 danieljabailey has joined #pypy

23:03 jacob22_ is now known as jacob22

23:16 inad922 has quit [Ping timeout: 260 seconds]

23:23 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

23:25 <bbot2> Failure: http://buildbot.pypy.org/builders/own-linux-s390x/builds/908 [default]

23:26 jcea has joined #pypy

23:37 tbodt has joined #pypy

23:47 marr has quit [Ping timeout: 268 seconds]

23:52 tbodt has quit [Quit: My Mac has gone to sleep. ZZZzzz…]

23:57 tbodt has joined #pypy

23:59 dw has joined #pypy