OOpen Source SQL Parsers
Parsing SQL queries provides superpowers for monitoring data health. This post describes how to get started on parsing SQL for data observability. Query history of a data warehouse is a rich source of…
Data lineage includes the data origin, what happens to it and where it moves over time. Data lineage gives visibility while greatly simplifying the ability to trace errors back to the root cause in a data analytics process.
Parsing SQL queries provides superpowers for monitoring data health. This post describes how to get started on parsing SQL for data observability. Query history of a data warehouse is a rich source of…
Open source Python project data-lineage now supports column level data lineage. Column lineage enables fine-grained data governance projects for all stake holders. Data Stewards can verify…
This blog will describe how to generate data lineage using the data-lineage python package from query history in Snowflake. data-lineage generates DAG from parsing SQL statements in query history…
Today we released an open source Python project data-lineage to visualize and analyze data lineage. The project was developed in collaboration with data teams on data governance initiatives over the…
What is meant by data lineage ? In Biology, lineage is a sequence of species each of which is considered to have evolved from its predecessor. Similarly, Data Lineage is a sequence of transformations…
Data Lineage tracks data transformation through all systems. It is important for data governance and security. In Data warehouses and data lakes, a team of data engineers maintain a canonical set of…