Skip to main content

`datapipe` is a realtime incremental ETL library for Python application

Project description

Datapipe

Datapipe is a real-time, incremental ETL library for Python with record-level dependency tracking.

The library is designed for describing data processing pipelines and is capable of tracking dependencies for each record in the pipeline. This ensures that tasks within the pipeline receive only the data that has been modified, thereby improving the overall efficiency of data handling.

https://datapipe.dev/

Development

At the moment these branches are active:

  • master - current development state, will be promoted to 0.13.x series release once ready
  • v0.13 - current stable version
  • v0.11 - legacy stable version (v0.12 was skipped due reasons)

Version compatibility

At the moment, the datapipe library is under active development. Versions: v0.*.*

It should be expected that each minor version is not backward compatible with the previous one. That is, v0.7.0 is not compatible with v0.6.1. Dependencies should be fixed to the exact minor version.

After stabilization and transition to the major version v1.*.*, the common rules will apply: all versions with the same major component are compatible.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

datapipe_core-0.13.10.post1.tar.gz (41.5 kB view hashes)

Uploaded Source

Built Distribution

datapipe_core-0.13.10.post1-py3-none-any.whl (51.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page