A CLI to work with DataHub metadata
Project description
Introduction to Metadata Ingestion
:::tip Find Integration Source Please see our Integrations page to browse our ingestion sources and filter on their features. :::
Integration Methods
DataHub offers three methods for data ingestion:
- UI Ingestion : Easily configure and execute a metadata ingestion pipeline through the UI.
- CLI Ingestion guide : Configure the ingestion pipeline using YAML and execute by it through CLI.
- SDK-based ingestion : Use Python Emitter or Java emitter to programmatically control the ingestion pipelines.
Types of Integration
Integration can be divided into two concepts based on the method:
Push-based Integration
Push-based integrations allow you to emit metadata directly from your data systems when metadata changes. Examples of push-based integrations include Airflow, Spark, Great Expectations and Protobuf Schemas. This allows you to get low-latency metadata integration from the "active" agents in your data ecosystem.
Pull-based Integration
Pull-based integrations allow you to "crawl" or "ingest" metadata from the data systems by connecting to them and extracting metadata in a batch or incremental-batch manner. Examples of pull-based integrations include BigQuery, Snowflake, Looker, Tableau and many others.
Core Concepts
The following are the core concepts related to ingestion:
- Sources: Data systems from which extract metadata. (e.g. BigQuery, MySQL)
- Sinks: Destination for metadata (e.g. File, DataHub)
- Recipe: The main configuration for ingestion in the form or .yaml file
For more advanced guides, please refer to the following:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file acryl_datahub-1.4.0.6.tar.gz.
File metadata
- Download URL: acryl_datahub-1.4.0.6.tar.gz
- Upload date:
- Size: 2.7 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.19
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2d6bdc5994681f2a99732cee70aaa058038b3aa0a5ab34274de1afc61f62935b
|
|
| MD5 |
e0f41550a20250b233aeaccf1f9f5077
|
|
| BLAKE2b-256 |
9cc9b60fd17b0e739d825eb6d51947104f5542228cc7d547029fc6308c2971b8
|
File details
Details for the file acryl_datahub-1.4.0.6-py3-none-any.whl.
File metadata
- Download URL: acryl_datahub-1.4.0.6-py3-none-any.whl
- Upload date:
- Size: 3.3 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.19
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b5c50d3223b65daaedf3a64a063de6097dcfe1bfa6a5b260052df22acfaf3cdc
|
|
| MD5 |
c8458752b5dc8bb4475d5aa83ab18625
|
|
| BLAKE2b-256 |
e9d91eff0a71934c4267707f0a5ab811f656309513e7876baf29d9fc25d45835
|