Nested storing of chunks #19

SabineEmbacher · 2021-02-09T18:34:34Z

SabineEmbacher
Feb 9, 2021
Maintainer

If chunks are stored flat (i.e. chunk 0.0.0 next to chunk 1.0.0 etc.), this can result in a large number of chunk files in a single directory. This can slow down a file system (e.g. local file system) considerably.
To prevent this, there should be a way to store the chunk files in a nested way.

Writing can be easily implemented by telling the array when it is created whether chunks are to be stored flat or nested.

When reading, it becomes more difficult.
If the .zarray file remains unchanged and contains no information about whether chunks were written flat or nested, the directory must be searched for chunk files in order to decide whether to read flat or nested.

Even if the .zarray file written by jZarr contains the information whether chunks files should be written flat or nested, the problem persists as soon as one tries to read from an array that was not written by jZarr but python zarr, left no information in the .zarray file, but the chunks were written nested.

The problem could be solved by having the chunk name generator of the FileSystemStore always return an array of two chunk file names as long as the status is unresolved.
Array size = 2.
The first name is the nested chunk name ... e.g. "1/4/3".
The second name is the flat chunk name ... e.g. "1.4.3".
The FileSystemStore then always tries to open a FileInputStream with the nested name first.
If this succeeds, the status is reported to the array, the .zarray file is adjusted (if write access exists) and the chunk name generator is replaced by a nested name generator.
The other way round, if a chunk file in flat name style is found first.

Then there still exist another edge case problem.
What if we open a zarr array to add data to it, which was initialized by someone else, but in which no chunks have been written yet.
If there is no information in the .zarray file whether chunks should be written flat or nested, it would be possible to calculate how many chunk files this would be and decide with a threshold value whether to write flat or nested. Such a threshold value could be specified in the form of a VM property.
Alternatively a default behavior for such cases can be defined.
E.g. default is flat ... zarr v2 specification
But this standard behaviour could also be overwritten by a VM parameter at java application start.

What do you think about it?
Good or not good?
Have I missed any cases?
Other suggestions?

joshmoore · 2021-02-09T20:05:17Z

joshmoore
Feb 9, 2021

Few evening additions:

the iterate suggestion is interesting, but as with my suggestion as well, it'll be interesting to see what the overhead is in practice.
if we come up with a viable solution then I could certainly see implementing in zarr-python as well.
there was some discussion in https://gitter.im/zarr-developers/community with at least one vote for just going to "/" now ;)
the above would likely help inform whether or not an "automatic switch" would be necessary/useful. (E.g. one of the arguments was that "/" will always perform better on S3)

1 reply

sbesson Feb 9, 2021

A few additional thoughts mostly inspired from our experience with file format specification (OME-TIFF) as well as supporting reading/writing implementations (Bio-Formats + OME Files)

completely agree with the asymmetry discussed above. While the writing should be unequivocally compliant, strict and include as much essential metadata as possible, the reading part will typically handle more lenient use cases
the frontier between what should or should not be supported in these reading libraries sometimes become very thin. The tension typically arises between trying to support as much real (and valuable) datasets as possible on the one side and staying faithful to the specification and keeping the library performant and maintainable on the other hand
different internal layouts are an inherent part of the evolution of binary containers. In the OME-TIFF land, the most recent example was the introduction of SubIFDs to support 2D pyramidal resolutions which raised a lot of internal discussion around backwards compatibility.

In general, my vote would be to solve this type of scenario via metadata whenever possible i.e. if multiple chunk layouts (nested vs flat) are supported:

most importantly, define on a common metadata semantics to specify and store the chunk layout and have it in the Zarr (v3?) specification
given the potential implications for libraries, this metadata should probably be recommended in the RFC 2119 sense
work through all the implementations to support the systematic writing of this metadata as suggested above
while reading Zarr where this metadata is not present, having some logic allowing the automatic detection makes sense with the performance caveats discussed above. Depending on the way this is specified, it might make sense to have the code warn about the lack of metadata specifying the chunk layout
assuming there is some substantial overhead associated with the layout detection, there might be some value in having some API allowing to enabling/disabling a strict reading mode which would fail in case the metadata is not specified.

SabineEmbacher · 2021-02-10T07:22:45Z

SabineEmbacher
Feb 10, 2021
Maintainer Author

I thought of another possibility that would allow the .zarray file to remain unchanged and still be able to quickly find out whether the chunks were written nested or flat.

For this, one thing should be kept in mind from the beginning.
A storage is a key(string)/value(bytes) mapping. How the key is handled internally is ultimately just an implementation detail of a storage implementation.
See https://zarr.readthedocs.io/en/stable/api/storage.html

0-position pixel strategy

For this, a 0-position pixel with fill value would always have to be written when creating a zarr array.

Python initialisation example for a 4 dimensional array:

a = zarr.create(shape=(40, 50, 60, 70), chunks=(5, 5, 5, 5))
# next line should then be an internal initialisation step (part of create)
a[0, 0, 0, 0] = fill_value

When opening an array to read from it, an initialisation step could be to read the 0-position pixel. This initialisation step would then only need to interate once over the possible key types and initialise the array with the correct chunk name creator.

This zarr array "create" initialisation would be easy to implement in any zarr implementation language, as everything needed for this is already available.

Only the zarr array "open for reading" initialisation would have to iterate through the possible chunk name types once, when reading the 0-position pixel, to set the correct chunk key generator in the array.

I think, this procedure should work for any storage implementation, because it is ultimately only a key string.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nested storing of chunks #19

{{title}}

Replies: 2 comments 1 reply

{{title}}

{{title}}

{{title}}

{{editor}}'s edit

{{editor}}'s edit

Select a reply

Nested storing of chunks #19

SabineEmbacher Feb 9, 2021 Maintainer

Replies: 2 comments · 1 reply

joshmoore Feb 9, 2021

sbesson Feb 9, 2021

SabineEmbacher Feb 10, 2021 Maintainer Author

0-position pixel strategy

SabineEmbacher
Feb 9, 2021
Maintainer

Replies: 2 comments 1 reply

joshmoore
Feb 9, 2021

SabineEmbacher
Feb 10, 2021
Maintainer Author