Introduction
Elementary is an open-source data observability platform for modern data teams that is open and transparent. The first module is data lineage.
Elementary was built out of the need to effortlessly and immediately gain visibility into the data stack, starting with tracing the actual upstream & downstream dependencies in the data warehouse, without any implementation efforts, security risks or compromises on accuracy.
Data lineage made simple, reliable, and automated. It is a new open-source tool that focuses on simple setup and immediate visualization, no complex deployments or code changes are required. It consists of a CLI that analyzes logs and metadata and generates a web interface.
⭐ If you like what we are building, support us with a
(Some of the features are under construction
🚧
)

Features

    Lineage visualization: Visual map of data flow and dependencies in the data warehouse.
    Dataset status: Present data about freshness and volume on the lineage graph.
    Accuracy: Reflects the actual state in the DWH based on logs.
    Plug-and-play: No need for code changes.
    Graph filters: Filter the graph by dataset, dates, direction, and depth.
Coming soon:
    Dataset monitoring: Continuously collect and monitor datasets' reliability metrics.
    Lineage history: Store data about lineage versions and changes.
    Column level lineage: Add column-level granularity.
    Full lineage: Integrate with downstream and upstream tools to create a full graph.
Our plan is to eventually build a data observability platform that is open, transparent and powered by the community. A solution that data teams could easily integrate into their workflows, detect data incidents and prevent them from even happening in the first place.

Quick start

1
pip install elementary-lineage
2
3
# The tool is named edl (Elementary Data Lineage),
4
# run it to validate the installation:
5
edl --help
Copied!
Now we need a connection file in a simple YAML called profiles.yml. Here is a template to create a Snowflake \ BigQuery one. Go to our quickstart page for further instructions on how to create it.
If you use dbt, you can start right away by running this command with the path to your profiles.yml and the relevant profile name (note that in BigQuery location is required):
1
edl -d ~/profiles_dir -p <profile_name>
Copied!

Community & Support

For additional information and help, you can use one of these channels:
Slack (live chat with the team, feature requests, community support, discussions, etc.)
GitHub (bug reports, contributions)
Twitter(updates on new releases and stuff)
📧
Email us (about whatever)

Integrations

Ask us for integrations on Slack or as a Github issue.

License

Elementary is licensed under Apache License 2.0. See the LICENSE file for licensing information.
Last modified 7d ago