A consistent approach to file operations, anywhere.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Cabinets

cabinets is a Python library that provides a consistent interface for file operations across multiple storage platforms. File extensions are dynamically detected to allow automatic serialization and deserialization of Python objects. cabinets supports a variety of protocols and file format parsers natively, and new protocols or parsers can be easily registered.

Sample Usage

Read a file

Set up a test file in your local filesystem:

import json

obj = {'test': 1}

with open('data.json', 'w') as fh:
    json.dump(obj, fh)

Read back and parse the file using cabinets:

from cabinets import Cabinets

new_obj = Cabinets.read('file://test.json')

assert new_obj == obj

That's it! The file is loaded and parsed in just one line.

Write a file

Cabinet also supports creating files. We can rewrite the first example using only cabinets.

from cabinets import Cabinets

obj = {'test': 1}

Cabinets.create('file://test.json', obj)

new_obj = Cabinets.read('file://test.json')

assert new_obj == obj

Built-in Protocols and Parsers

Protocols

Local File System (file://)
S3 (s3://)

Parsers

YAML (.yml, .yaml)
JSON (.json)
Python Pickle (.pickle)
CSV (beta) (.csv)

Custom Protocols and Parsers

cabinets is designed to allow complete extensibility in adding new protocols and parsers. Just because your desired storage platform or file format is not listed above, doesn't mean you can't use it with cabinets!

Adding a Parser

Adding a new parser is as simple as subclassing cabinets.parser.Parser and registers associated file extensions.

from typing import Any
from cabinets.parser import Parser, register_extensions


@register_extensions('foo', 'bar')
class FooParser(Parser):

    @classmethod
    def _load_content(cls, content: bytes) -> Any:
        return deserialize_foo(content)  # custom deserialization logic

    @classmethod
    def _dump_content(cls, data: Any) -> bytes:
        return serialize_foo(data)  # custom serialization logic

Then to load a test.foo file you can simply use Cabinet.read.

NOTE: In order for the extension to be registered, the class definition must be run at least once. Make sure the modules where your custom Parser classes are defined are imported somewhere before they are used.

from cabinets import Cabinets

# .foo file in local filesystem
local_foo_data = Cabinets.read('file://test.foo')

# .foo file in S3
s3_foo_data = Cabinets.read('s3://test.foo')

Protocol Configuration

Some storage platform protocols may require some configuration parameters to be set before they can be used. Each Cabinet subclass can expose a set_configuration(**config) classmethod to take care of any required initial setup.

from cabinets.protocols.s3 import S3Cabinet

# set the AWS S3 region to us-west-2 and specify an access key
S3Cabinet.set_configuration(region_name='us-west-2', aws_access_key_id=...)

# use specific Cabinet to avoid protocol prefix
S3Cabinet.read('bucket-in-us-west-2/test.json') 
# or use generic Cabinet with protocol prefix
from cabinets.cabinet import Cabinets
Cabinets.read('s3://bucket-us-west-2/test.json')

See the documentation of specific Cabinet classes for what configuration parameters are available.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.7.0

Apr 29, 2022

0.6.1

Apr 26, 2021

0.6.0

Feb 20, 2021

0.5.1

Feb 17, 2021

0.5.0

Feb 17, 2021

0.4.0

Jan 22, 2021

0.3.0

Jan 20, 2021

0.2.1

Jan 16, 2021

This version

0.2.0

Jan 16, 2021

0.1.3

Jan 16, 2021

0.1.2

Jan 16, 2021

0.1.1

Jan 16, 2021

0.1.0

Jan 15, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cabinets-0.2.0.tar.gz (21.7 kB view hashes)

Uploaded Jan 16, 2021 Source

Built Distribution

cabinets-0.2.0-py3-none-any.whl (20.5 kB view hashes)

Uploaded Jan 16, 2021 Python 3

Hashes for cabinets-0.2.0.tar.gz

Hashes for cabinets-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`47cc8cebdd5e11e6fd60341a445925295f800d2e74cf818fb41ab9d6d274d339`
MD5	`f1377093a3a5a1eb3b1b4a223f3c2198`
BLAKE2b-256	`a2aad1d49f4230b5af9ced4e4e52568f84526ffea61d4e3054e07813d30bf3f6`

Hashes for cabinets-0.2.0-py3-none-any.whl

Hashes for cabinets-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c28fdc8910e052d5d0dec8be220200cebb5683465dd7d7cdbfa8c428a4dbfc50`
MD5	`d899c42cda36bb6ec99cee601eeeec7f`
BLAKE2b-256	`585afb4f561e5a205256b396a02e1013d45bc7d341d1eebd5758b2d5e11e533a`