Tagged articles
2 articles
Page 1 of 1
DataFunSummit
DataFunSummit
Sep 19, 2025 · Big Data

Unlocking Data Lineage: SQL Bloodline for Discovery, Governance & Protection

This article explains how SQL lineage (bloodline) technology can be leveraged in offline data warehouses to enable precise data discovery, automated tag propagation, fine‑grained data governance, column‑level TTL management, and dynamic masking for data protection, illustrating implementation steps, strategies, and real‑world use cases.

Data GovernanceDynamic MaskingSQL lineage
0 likes · 28 min read
Unlocking Data Lineage: SQL Bloodline for Discovery, Governance & Protection
Sohu Tech Products
Sohu Tech Products
Dec 2, 2020 · Big Data

Optimizing Hive SQL Lineage Parsing: Techniques, Implementation, and Practical Insights

This article presents a comprehensive overview of Hive SQL lineage parsing, detailing the challenges of data provenance in large‑scale data warehouses, introducing ANTLR‑based parsing techniques, and describing a series of optimizations—including AST pruning, CTE handling, UDF registration, and metadata service integration—to improve both table‑level and column‑level lineage extraction and visualization.

ANTLRData WarehouseHive
0 likes · 18 min read
Optimizing Hive SQL Lineage Parsing: Techniques, Implementation, and Practical Insights