Skip to main content

Core utilities to serve HDF5 file contents

Project description

h5grove, core utilities to serve HDF5 file contents

h5grove is a Python package that provides utilities to design backends serving HDF5 file content: attributes, metadata and data. HDF5 files are accessed with h5py.

Rationale

There are several packages out there that can serve HDF5 files. However, they are dedicated to their usecases and settle on one implementation, hampering reusability.

In addition, some problems arise constantly when designing HDF5 backends. To name a few:

  • Resolving external links
  • Dealing with compression and slicing of datasets
  • Encoding data efficiently and consistently (looking at you, NaN, Infinity in JSON)

h5grove aims at providing building blocks that solve these common problems and can be reused in existing or new backends.

Installation

pip install h5grove

You can use h5grove low-level utilities whatever the backend implementation you choose. We simply provide additional utilities for Tornado and Flask that can be installed with:

pip install h5grove[flask] # For Flask
pip install h5grove[tornado] # For Tornado

Contents

Example implementations using Flask and Tornado are given in the example folder. These are functional backends that make use of the utilities provided by the h5grove package.

For more tailored use, you can make use of the low-level utilities in your own project. The package contains the following modules:

  • content: A hierarchy of Content classes that extract the relevant information (be it attributes, metadata or data) from the file (resolving links if possible) and expose them through methods.

Ideally, getting the information from a path in the file should be as simple as:

with h5py.File(filepath, "r") as h5file:
    content = create_content(h5file, path)
    # Get metadata (valid for all entities)
    content.metadata()
    # Get data (only valid for datasets)
    content.data()
  • encoders: Functions that encode data and provide the appropriate headers to build request responses. The module provides a JSON and a binary encoder.
  • flaskutils: Utilities dedicated to Flask backends. Notably provides a Blueprint.
  • tornadoutils: Utilities dedicated to Tornado backends.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

h5grove-0.0.2.tar.gz (11.4 kB view hashes)

Uploaded Source

Built Distribution

h5grove-0.0.2-py3-none-any.whl (9.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page