Tag

data catalog

0 views collected around this technical thread.

Data Thinking Notes
Data Thinking Notes
Aug 13, 2024 · Fundamentals

How to Define, Classify, and Catalog Your Enterprise Data Assets

This article explains what data assets are, how to categorize them by structure and source, outlines a six‑step inventory process, describes a hierarchical catalog architecture, and highlights the four key benefits of a unified data asset directory for modern enterprises.

data assetsdata catalogdata governance
0 likes · 11 min read
How to Define, Classify, and Catalog Your Enterprise Data Assets
Data Thinking Notes
Data Thinking Notes
Jul 9, 2024 · Big Data

How to Build a Robust Enterprise Data Asset Catalog for Better Governance

This article explains why a comprehensive data asset catalog is essential for modern enterprises, outlines its core components such as inventory, metadata, data lineage, standards and access control, details step‑by‑step construction methods, and highlights key applications in governance, quality, compliance, architecture and valuation.

Big Datadata catalogdata governance
0 likes · 13 min read
How to Build a Robust Enterprise Data Asset Catalog for Better Governance
Data Thinking Notes
Data Thinking Notes
Nov 19, 2023 · Fundamentals

How to Build an Effective Data Asset Management Framework for Enterprises

This article explains why enterprises need a data asset framework, outlines its key components such as catalog management, policy support, and development trends, and provides a step‑by‑step guide with visual diagrams for constructing and operating a comprehensive data asset management system.

data asset managementdata catalogdata governance
0 likes · 5 min read
How to Build an Effective Data Asset Management Framework for Enterprises
Architects Research Society
Architects Research Society
Aug 2, 2023 · Fundamentals

Data Fabric Architecture: Three Patterns, Core Technical Components, and Inherent Limitations

The article explains data fabric architecture as a promising approach for enabling data exchange across distributed systems, outlines its three design patterns, describes key technical components such as data virtualization, data catalog, and knowledge graphs, and discusses the trade‑offs, costs, and limitations that organizations must consider.

Data FabricData VirtualizationKnowledge Graph
0 likes · 17 min read
Data Fabric Architecture: Three Patterns, Core Technical Components, and Inherent Limitations
Data Thinking Notes
Data Thinking Notes
Jul 26, 2023 · Big Data

How to Build an Effective Data Asset Catalog for Enterprise Data Governance

This article explains what data assets are, why a data asset catalog is essential for data governance, and provides a step‑by‑step framework—including identification criteria, value dimensions, construction phases, tool support, and core functional modules—to help enterprises systematically create, manage, and leverage a data asset catalog.

data assetdata catalogdata governance
0 likes · 16 min read
How to Build an Effective Data Asset Catalog for Enterprise Data Governance
DataFunSummit
DataFunSummit
Apr 23, 2023 · Fundamentals

Data Governance Practices and Implementation Path at Dipu Technology

This article presents Dipu Technology's comprehensive data governance methodology, covering construction paths, a typical enterprise digital platform framework, core governance components, practical case studies, and a Q&A session that together illustrate how businesses can design, implement, and sustain effective data governance across the organization.

Enterprise Architecturedata catalogdata governance
0 likes · 19 min read
Data Governance Practices and Implementation Path at Dipu Technology
ByteDance Data Platform
ByteDance Data Platform
Jun 8, 2022 · Backend Development

How ByteDance Optimized Data Catalog Performance with Apache Atlas and JanusGraph

This article details ByteDance's 2021 overhaul of its Data Catalog system, the performance regressions encountered after switching to Apache Atlas, and the step‑by‑step backend optimizations—including JanusGraph tuning, Gremlin query refactoring, parallel processing, and write‑path improvements—that reduced latency from minutes to seconds.

Apache AtlasBackendJanusGraph
0 likes · 12 min read
How ByteDance Optimized Data Catalog Performance with Apache Atlas and JanusGraph
Architect
Architect
May 25, 2022 · Big Data

Metadata Infrastructure and Governance in Bilibili's Data Platform

The article details how Bilibili built a unified metadata infrastructure—including a URN‑based model, collection pipelines, quality assurance, storage in TiDB/ES/HugeGraph, and query services—to support data discovery, lineage, impact analysis, and governance across its growing data platform.

Big DataETLdata catalog
0 likes · 21 min read
Metadata Infrastructure and Governance in Bilibili's Data Platform
ByteDance Data Platform
ByteDance Data Platform
Apr 27, 2022 · Big Data

How ByteDance Built a Scalable Data Catalog: Key Technologies and Future Plans

ByteDance’s Data Catalog article details the system’s unified metadata model, standardized ingestion connectors, search optimization techniques, lineage capabilities, and storage layer enhancements, highlighting key technical designs, performance improvements, and future work to advance data governance and asset utilization.

Big Datadata catalogdata lineage
0 likes · 12 min read
How ByteDance Built a Scalable Data Catalog: Key Technologies and Future Plans
Snowball Engineer Team
Snowball Engineer Team
Sep 24, 2019 · Big Data

Snowball Data Middle Platform (AIBO): Architecture, Capabilities, and Future Outlook

The article introduces Snowball's AIBO data middle platform, detailing its storage‑compute separation architecture, core capabilities such as data integration, catalog, tagging, analysis tools, micro‑service data APIs, and outlines future enhancements for security, lineage, and continuous business‑driven iteration.

Big Datadata analysisdata catalog
0 likes · 12 min read
Snowball Data Middle Platform (AIBO): Architecture, Capabilities, and Future Outlook