HDF5 wrapper

HDF5 Installation

Download the tar ball from the official homepage. Make sure to always use the latest version (currently: 1.12.2). Older versions (1.8 and below) may result in unintentional problems, especially with non-native datatypes like logical. Installation for e.g. gnu compiler is done via

tar xf hdf5-x.y.z.tar.gz
cd hdf5-x.y.z
CC=gcc
FC=gfortran
mkdir /home/USER/opt
./configure --enable-fortran --prefix=/home/USER/opt/hdf5-x.y.z_gcc
make
make check
make install

For more detailed library information please refer to [Libraries and tools reference].

Wrapper installation

Compile the file hdf5_wrapper.F90 with proper linking to your local hdf5 library. In your program use the binding use hdf5_wrapper to use the wrapper (see test_example.f90). A minimalistic compilation script can be found in compile_gfortran.sh and compile_ifort.sh.

General thoughts

This wrapper tries to mimic unix-like behavior: Group and dataset access is done with typical unix-like path formalism (/ is the root folder; nested groups can be created via /group1/group2) similarly to the Python library h5py. The data read-in routines can be used by simply providing allocatable data arrays of the most commonly used datatypes in Fortran (see below). Please note that there are no dimensionality or datatype checks. While the majority of routines are subroutines (call keyword required) some are simple functions which either return logical or integer datatypes. These are marked accordingly in Summary of commands.

Implicit data transposition

One of the biggest pitfalls when handling hdf5 files with Fortran is the implicit data transposition. While languages like C and Python employ row-major ordering, Fortran employs column-major ordering. This must be kept in mind when interfacing these languages. Creating a three-dimensional dataset with a shape of [4, 8, 2] via this wrapper will naturally result in an array shape with a full transposition [2, 8, 4] in C, Python, and tools like h5ls provided by the HDF5 library.

A few handy commands

Inspecting hdf5 files from the shell can easily be done with h5ls. Some helpful flags include

h5ls -lr: (r)ecursively (l)ist all groups and datasets
h5ls -vlr: (v)erbosely and (r)ecursively (l)ist everything (this includes attributes, datatypes, etc.)
h5ls -d: inspect the (d)ata directly by simply appending the full unix-like path after the file without space (e.g. file.hdf5/group1/dset)

HDF5 interface and file handling

hdf5 command	description
`hdf5_init()`	initialize hdf5 interface
`hdf5_finalize()`	close hdf5 interface
`hdf5_create_file(fname)`	create hdf5 file
`hdf5_open_file(fname, ifile [,rdonly])`	open hdf5 file
`hdf5_close_file(ifile)`	close hdf5 file

fname represents a Fortran string (character array) while ifile is a an integer of the integer(hid_t) (hence we also need to make the hdf5 library available outside the wrapper).

program filehandling
	use hdf5
	use hdf5_wrapper

	integer(hid_t) :: ifile

	call hdf5_init()
	call hdf5_create_file('test.hdf5')
	call hdf5_open_file('test.hdf5', ifile)
	call hdf5_close_file(ifile)
	call hdf5_finalize()
end program

Groups

hdf5 command	description
`hdf5_create_group(ifile, gname)`	create group(s) (parents included)
`hdf5_list_groups(ifile, gname)`	get list of groups
`hdf5_get_number_groups(ifile, gname)`	get number of lists
`hdf5_group_exists(ifile, gname)`	check if specific group exists

We are able to create groups by simply providing the file identifier ifile and the full (unix-like) path. The wrapper checks for the existence of parent groups and creates them if they do not exist.

program groups
	use hdf5
	use hdf5_wrapper

	integer(hid_t) :: ifile
	integer :: ngroups
	character(len=100), allocatable :: list(:)

	call hdf5_init()
	call hdf5_create_file('test.hdf5')
	call hdf5_open_file('test.hdf5', ifile)

	call hdf5_create_group(ifile, '/group1/group2/group3')
	call hdf5_create_group(ifile, '/group1/group4')
	call hdf5_create_group(ifile, '/group1/group5')

	write(*,*) hdf5_group_exists(ifile, 'group1/group2')

 	! the list array gets allocated and afterwards contains the group names
	call hdf5_list_groups(ifile, '/group1', list)
	ngroups = hdf5_get_number_groups(ifile, '/group1') ! 3

	call hdf5_close_file(ifile)
	call hdf5_finalize()
end program

The resulting file can be inspected by h5ls -lr.

Datasets

hdf5 command	description
`hdf5_read_data(ifile, dname, adata)`	read data from dataset
`hdf5_write_data(ifile, dname, data)`	write datasets from arrays
`hdf5_list_datasets(ifile, group, list)`	get list of datasets
`hdf5_get_number_datasets(ifile, group)`	get number of datasets
`hdf5_dataset_exists(ifile, group)`	check if specific dataset exists
`hdf5_get_dimensions(ifile, dataset)`	get number of dimensions
`hdf5_get_shape(ifile, dataset, shape_array)`	get shape in form of an array

The dataset functions work in the same way as the group functions. One can create datasets with a full (unix-like) path where, again, the non-existent parent groups are created on the fly. We provide wrappers for the following datatypes for datasets:

logical
integer
real(4)
real(8)
complex(4)
complex(8)

The in/output is supported for all possible 0D to 7D (Fortran maximum) arrays. The readin functions work the same way, only that we have to provide an allocatable array with the matching dimensionality instead. Please note that there is neither a check for matching datatypes nor a check for matching dimensions. From the above list HDF5 natively supports only integer, real(4), and real(8). logical, complex(4), and complex(8) on the other hand must be constructed manually. In order to achieve maximum compatibility with h5py (supported datatypes) the identical structures are employed:

hdf5 datatype	internal
`logical`	HDF5 enum (0: FALSE, 1: TRUE) - h5t_native_b8
`integer`	h5t_native_integer
`real(4)`	h5t_native_real
`real(8)`	h5t_native_double
`complex(4)`	HDF5 struct ("r", "i") - h5t_native_real
`complex(8)`	HDF5 struct ("r", "i") - h5t_native_double

program datasets
	use hdf5
	use hdf5_wrapper

	complex(8) :: a(3,5,2) = (1.2d0, 0.2d0)
	complex(8), allocatable :: b(:,:,:)
	integer(hid_t) :: ifile
	integer :: ngroups, dimensions
	character(len=100), allocatable :: list(:)
	integer, allocatable :: hdf_shape(:)

	call hdf5_init()
	call hdf5_create_file('test.hdf5')
	call hdf5_open_file('test.hdf5', ifile)

	call hdf5_write_data(ifile, '/group1/group2/dataset', a)
	write(*,*) hdf5_data_exists(ifile, '/group1/group2/dataset')

	call hdf5_read_data(ifile, '/group1/group2/dataset', b)

	! same as for the groups
	call hdf5_list_datasets(ifile, '/group1/group2', list)
	ngroups = hdf5_get_number_datasets(ifile, '/group1/group2') ! 1

	! dimensions and shape
	call hdf5_get_shape(ifile, '/group1/group2/dataset', hdf_shape)
	dimensions = hdf5_get_dimensions(ifile, '/group1/group2/dataset')

	call hdf5_close_file(ifile)
	call hdf5_finalize()
end program

Attributes

hdf5 command	description
`hdf5_write_attribute(ifile, location, attr_name, attribute)`	write attribute
`hdf5_read_attribute(ifile, location, attr_name, variable)`	read attribute
`hdf5_attribute_exists(ifile, location, attr_name)`	check if specific attribute exists
`hdf5_get_number_attributes(ifile, location)`	get number of attributes
`hdf5_list_attributes(ifile, location, list)`	get list of attributes

Attributes additionally support strings of variable size:

logical
integer
real(4)
real(8)
complex(4)
complex(8)
character(len=*)

Attributes can be attached to both groups and datasets. For this reason we forgo automatic parent group creation. Any non-existence of objects will trigger an error. Please note that the written strings will be byte strings within Python.

program attributes
	use hdf5
	use hdf5_wrapper

	integer    :: x = 3
	real(8)    :: y

	call hdf5_init()
	call hdf5_create_file('test.hdf5')
	call hdf5_open_file('test.hdf5', ifile)

	call hdf5_write_data(ifile, '/group1/dataset', x)

	! attach to root
	call hdf5_write_attribute(ifile, '/', 'attr_root', .true.)
	! attach to existing group
	call hdf5_write_attribute(ifile, '/group1', 'attr_name1', 1.23d0)
	! attach to existing dataset
	call hdf5_write_attribute(ifile, '/group1/dataset', 'attr_name2', 'astring')

	! read the above attribute into the variable y
	call hdf5_read_attribute(ifile, '/group1', 'attr_name1', y)

	call hdf5_close_file(ifile)
	call hdf5_finalize()
end program

Deletion

hdf5 command	description
`hdf5_delete(ifile, location)`	delete group/dataset
`hdf5_delete_attribute(ifile, location, attr_name)`	delete attribute

Deletion of datasets and groups is internally done simply by unlinking the objects.

Summary of commands

command	description	function
`hdf5_init()`	initialize hdf5 interface	-
`hdf5_finalize()`	close hdf5 interface	-
`hdf5_create_file(fname)`	create hdf5 file	-
`hdf5_open_file(fname, ifile [,rdonly])`	open hdf5 file	-
`hdf5_close_file(ifile)`	close hdf5 file	-
`hdf5_create_group(ifile, gname)`	create group(s) (parents included)	-
`hdf5_list_groups(ifile, gname)`	get list of groups	-
`hdf5_get_number_groups(ifile, gname)`	get number of lists	yes (integer)
`hdf5_group_exists(ifile, gname)`	check if group exists	yes (logical)
`hdf5_read_data(ifile, dname, adata)`	read data from dataset	-
`hdf5_write_data(ifile, dname, data)`	write datasets from arrays	-
`hdf5_dataset_exists(ifile, group)`	check if dataset exists	yes (logical)
`hdf5_list_datasets(ifile, group, list)`	get list of datasets	-
`hdf5_get_number_datasets(ifile, group)`	get number of datasets	yes (integer)
`hdf5_get_dimensions(ifile, dataset)`	get number of dimensions	yes (integer)
`hdf5_get_shape(ifile, dataset, shape_array)`	get shape in form of an array	-
`hdf5_write_attribute(ifile, location, attr_name, attribute)`	write attribute	-
`hdf5_read_attribute(ifile, location, attr_name, variable)`	read attribute	-
`hdf5_attribute_exists(ifile, location, attr_name)`	check if attribute exists	yes (logical)
`hdf5_get_number_attributes(ifile, location)`	get number of attributes	yes (integer)
`hdf5_list_attributes(ifile, location, list)`	get list of attributes	-
`hdf5_delete(ifile, location)`	delete group/dataset	-
`hdf5_delete_attribute(ifile, location, attr_name)`	delete attribute	-

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

HDF5 wrapper

HDF5 Installation

Wrapper installation

General thoughts

Implicit data transposition

A few handy commands

HDF5 interface and file handling

Groups

Datasets

Attributes

Deletion

Summary of commands

Files

README.md

Latest commit

History

README.md

File metadata and controls

HDF5 wrapper

HDF5 Installation

Wrapper installation

General thoughts

Implicit data transposition

A few handy commands

HDF5 interface and file handling

Groups

Datasets

Attributes

Deletion

Summary of commands