Skip to main content

A scalable, fast, ACID-compliant Data Catalog powered by Ray.

Project description

DeltaCAT

DeltaCAT is a Pythonic Data Catalog powered by Ray.

Its data storage model allows you to define and manage fast, scalable, ACID-compliant data catalogs through git-like stage/commit APIs, and has been used to successfully host exabyte-scale enterprise data lakes.

DeltaCAT uses the Ray distributed compute framework together with Apache Arrow for common table management tasks, including petabyte-scale change-data-capture, data consistency checks, and table repair.

Getting Started

Install

pip install deltacat

Running Tests

pip3 install virtualenv
virtualenv test_env
source test_env/bin/activate
pip3 install -r requirements.txt

pytest

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deltacat-0.2.10.tar.gz (166.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

deltacat-0.2.10-py3-none-any.whl (245.9 kB view details)

Uploaded Python 3

File details

Details for the file deltacat-0.2.10.tar.gz.

File metadata

  • Download URL: deltacat-0.2.10.tar.gz
  • Upload date:
  • Size: 166.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for deltacat-0.2.10.tar.gz
Algorithm Hash digest
SHA256 7416f62b586baa474056470186e550bb23f2a789cf4cd2c8748bc686fd754720
MD5 4f4ff551a84348c28d2c1dd42bad04ae
BLAKE2b-256 20a46aaf25783f44d4989bac22dfed87a2dafdd2c3f8e14f47c89515aa0f69d5

See more details on using hashes here.

File details

Details for the file deltacat-0.2.10-py3-none-any.whl.

File metadata

  • Download URL: deltacat-0.2.10-py3-none-any.whl
  • Upload date:
  • Size: 245.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.8

File hashes

Hashes for deltacat-0.2.10-py3-none-any.whl
Algorithm Hash digest
SHA256 db4021e5bf1cf7d535c58c07fc7070c4cd5d30704e7c6c62e5bc573ff368157d
MD5 8f9067fef665d4b6c03de8cb9eea327d
BLAKE2b-256 404cda0551a162dba26dba9d546f6f42d9f5dc5add486e5821a08cf0f716200b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page