MemoryView provides the features to share multidimensional homogeneous arrays of fixed-size element on memory among extension libraries.
-
This feature is still experimental. The specification described here can be changed in the future.
-
This document is under construction. Please refer the master branch of ruby for the latest version of this document.
We sometimes deal with certain kinds of objects that have arrays of the same typed fixed-size elements on a contiguous memory area as its internal representation. Numo::NArray in numo-narray and Magick::Image in rmagick are typical examples of such objects. MemoryView plays the role of the hub to share the internal data of such objects without copy among such libraries.
Copy-less sharing of data is very important in some field such as data analysis, machine learning, and image processing. In these field, people need to handle large amount of on-memory data with several libraries. If we are forced to copy to exchange large data among libraries, a large amount of the data processing time must be occupied by copying data. You can avoid such wasting time by using MemoryView.
MemoryView has two categories of APIs:
-
Producer API
Classes can register own MemoryView entry which allows objects of that classes to expose their MemoryView
-
Consumer API
Consumer API allows us to obtain and manage the MemoryView of an object
A MemoryView structure, rb_memory_view_t
, is used for exporting objects' MemoryView.
This structure contains the reference of the object, which is the owner of the MemoryView, the pointer to the head of exported memory, and the metadata that describes the structure of the memory. The metadata can describe multidimensional arrays with strides.
The MemoryView structure consists of the following members.
-
VALUE obj
The reference to the original object that has the memory exported via the MemoryView.
RubyVM manages the reference count of the MemoryView-exported objects to guard them from the garbage collection. The consumers do not have to struggle to guard this object from GC.
-
void *data
The pointer to the head of the exported memory.
-
ssize_t byte_size
The number of bytes in the memory pointed by
data
. -
bool readonly
true
for readonly memory,false
for writable memory. -
const char *format
A string to describe the format of an element, or NULL for unsigned byte.
-
ssize_t item_size
The number of bytes in each element.
-
const rb_memory_view_item_component_t *item_desc.components
The array of the metadata of the component in an element.
-
size_t item_desc.length
The number of items in
item_desc.components
. -
ssize_t ndim
The number of dimensions.
-
const ssize_t *shape
A
ndim
size array indicating the number of elements in each dimension. This can beNULL
whenndim
is 1. -
const ssize_t *strides
A
ndim
size array indicating the number of bytes to skip to go to the next element in each dimension. This can beNULL
whenndim
is 1. -
const ssize_t *sub_offsets
A
ndim
size array consisting of the offsets in each dimension when the MemoryView exposes a nested array. This can beNULL
when the MemoryView exposes a flat array. -
void *private_data
The private data that MemoryView provider uses internally. This can be
NULL
when any private data is unnecessary.
-
bool rb_memory_view_available_p(VALUE obj)
Return
true
ifobj
supports to export a MemoryView. Returnfalse
otherwise.If this function returns
true
, it doesn't mean the functionrb_memory_view_get
will succeed. -
bool rb_memory_view_get(VALUE obj, rb_memory_view_t *view, int flags)
If the given
obj
supports to export a MemoryView that conforms the givenflags
, this function fillsview
by the information of the MemoryView and returnstrue
. In this case, the reference count ofobj
is increased.If the given combination of
obj
andflags
cannot export a MemoryView, this function returnsfalse
. The content ofview
is not touched in this case.The exported MemoryView must be released by
rb_memory_view_release
when the MemoryView is no longer needed. -
bool rb_memory_view_release(rb_memory_view_t *view)
Release the given MemoryView
view
and decrement the reference count ofview->obj
.Consumers must call this function when the MemoryView is no longer needed. Missing to call this function leads memory leak.
-
ssize_t rb_memory_view_item_size_from_format(const char *format, const char **err)
Calculate the number of bytes occupied by an element.
When the calculation fails, the failed location in
format
is stored intoerr
, and returns-1
. -
void *rb_memory_view_get_item_pointer(rb_memory_view_t *view, const ssize_t *indices)
Calculate the location of the item indicated by the given
indices
. The length ofindices
must equal toview->ndim
. This function initializesview->item_desc
if needed. -
VALUE rb_memory_view_get_item(rb_memory_view_t *view, const ssize_t *indices)
Return the Ruby object representation of the item indicated by the given
indices
. The length ofindices
must equal toview->ndim
. This function usesrb_memory_view_get_item_pointer
. -
rb_memory_view_init_as_byte_array(rb_memory_view_t *view, VALUE obj, void *data, const ssize_t len, const bool readonly)
Fill the members of
view
as an 1-dimensional byte array. -
void rb_memory_view_fill_contiguous_strides(const ssize_t ndim, const ssize_t item_size, const ssize_t *const shape, const bool row_major_p, ssize_t *const strides)
Fill the
strides
array with byte-Strides of a contiguous array of the given shape with the given element size. -
void rb_memory_view_prepare_item_desc(rb_memory_view_t *view)
Fill the
item_desc
member ofview
. -
bool rb_memory_view_is_contiguous(const rb_memory_view_t *view)
Return
true
if the data in the MemoryViewview
is row-major or column-major contiguous.Return
false
otherwise. -
bool rb_memory_view_is_row_major_contiguous(const rb_memory_view_t *view)
Return
true
if the data in the MemoryViewview
is row-major contiguous.Return
false
otherwise. -
bool rb_memory_view_is_column_major_contiguous(const rb_memory_view_t *view)
Return
true
if the data in the MemoryViewview
is column-major contiguous.Return
false
otherwise.