In Biology, lineage is a sequence of species considered to have evolved from their respective predecessors. Similarly, Data Lineage is the sequence of transformations through intermediary systems to a final dataset. Datasets draw information from predecessors processed using a SQL query or a program in a language such as Python or Scala. Data Lineage can be at any granular level - schema, table, or column.
Data Lineage enables data governance functions such as:
A Data Lineage Tool captures metadata of all data transformations, organizes the metadata in a graph, and provides access through visual interfaces and programmable APIs. Generally, data lineage tools use two techniques:
Use Tokern Data Lineage to visualize data lineage of data in Snowflake.
Get in touch for bespoke support for PII Catcher
We can help discover, manage and secure sensitive data in your data warehouse.