Overview

data-lineage can be used as a library in a Jupyter Notebook. In this mode, a developer has complete freedom to provide SQL queries as input. The library provides two types of outputs:

NetworkX graphs can be used to analyze the data lineage graph. For example, it can answer a question such as:

Give me all the columns from all tables that are derived from a base table columns.

Plotly Network graphs can be used to visually analyze data lineage. The graphs have rich metadata as tooltips.

Installation

# Install packages
pip install data-lineage
pip install jupyter
jupyter notebook
# Checkout example notebook: http://tokern.io/docs/data-lineage/example/