Research upsides and downsides of different wrapping techniques for Python #179

arthurp · 2021-04-23T15:06:51Z

Tools to look at:

Cython (The current Python "native" standard)
Pybind11 https://pybind11.readthedocs.io/en/stable/ (The current C++ "native" standard)
SWIG (The old standard, kind of a mess)
cppyy https://cppyy.readthedocs.io/en/latest/ (probably not stable enough yet, but interesting)
Custom tools (could provide much more automation than generic tools, but too much effort to be worth it)
Others?

JIRA: https://katanagraph.atlassian.net/browse/ENG-322

insertinterestingnamehere · 2021-04-23T15:37:03Z

Just to add on here, IMHO, the ideal for many numerical applications would revolve around a workflow that goes something like this: library -> wrappers using standardized (WRT a MOP) data structures that are available in each source language -> auto-generated wrapper exposed in any target language via a MOP. None of the existing solutions really do this though since no sufficiently general MOP for regular/irregular data in computationa/data science.

insertinterestingnamehere · 2021-04-23T15:46:43Z

I think the XTensor people have a somewhat similar idea going: they bind stuff to their C++ data structures and then have facilities for quickly adapting those things to be usable in Python/Julia/R/etc. The main idea is just that autogenerating can work great if there's a unifying data layout/style with corresponding library data structures. Translating the idioms of a wrapped library to the desired library data structures and semantics can take place in whatever source language the wrapped library was made for, then the export to other languages can then be mostly automated.

arthurp · 2021-09-07T16:50:16Z

I have been thinking about this on and off for a couple of months now and I have a specific plan at this point. I plan to test it before making a final decisions, but I think it will work well.

Binding from C++ into Python is done with Pybind11. This supports a lot (see https://pybind11.readthedocs.io/en/stable/classes.html#), however, there will inevitably be API quality issues in the "raw" Pybind11 API exposed in Python that cannot easily be fixed from the C++ side (for instance, working around issues of unique_ptr arguments). Also, to integrate with Numba, we need to have some real Python code since the functions need to have Python bytecode for Numba to compile.

Many "raw" C++ bindings provided by Pybind11 will need another layer of wrapping at the Python level to provide any features that can only be effectively provided at the Python level. Hopefully this can be mostly automated with metaprogramming, but some custom wrapping may be needed for some classes. Especially to handle Python libraries types we want to interoperate with like pandas.

arthurp self-assigned this Apr 23, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Research upsides and downsides of different wrapping techniques for Python #179

Research upsides and downsides of different wrapping techniques for Python #179

arthurp commented Apr 23, 2021

insertinterestingnamehere commented Apr 23, 2021

insertinterestingnamehere commented Apr 23, 2021

arthurp commented Sep 7, 2021

Research upsides and downsides of different wrapping techniques for Python #179

Research upsides and downsides of different wrapping techniques for Python #179

Comments

arthurp commented Apr 23, 2021

insertinterestingnamehere commented Apr 23, 2021

insertinterestingnamehere commented Apr 23, 2021

arthurp commented Sep 7, 2021