Technical Paper Summaries on Graph Databases, Vector Databases, and Real-Time Data Warehousing
This article compiles concise English summaries of several technical papers covering Xiaohongshu's REDgraph graph database, DingoDB vector database, Tianqiong autonomous data platform, Douyin's real‑time data warehouse, financial‑grade data warehousing, Alibaba Cloud ClickHouse Serverless offering, best practices in financial data governance, and 58.com user‑profile data warehouse construction.
Exploration of Xiaohongshu Graph Database REDgraph for Distributed Parallel Queries
This paper details Xiaohongshu's self‑developed graph database system REDgraph, designed for massive social networks, optimizing distributed parallel queries to significantly improve query efficiency and performance. It discusses graph database concepts, comparisons with relational databases, application scenarios at Xiaohongshu, and technical challenges with solutions.
New‑Generation Vector Database DingoDB in the Era of Large Models
The article examines DingoDB’s multimodal vector database design and product advantages, highlighting its support for structured, semi‑structured, and unstructured data, high‑performance processing, and suitability for business intelligence, data stream analysis, and other scenarios in the large‑model era.
New Practices of Tianqiong Data Warehouse Autonomy in the Era of Large Models
This piece shares Tencent Tianqiong’s autonomous big‑data platform innovations for large‑model applications, covering data governance background, autonomous capability construction, a dual‑engine implementation strategy, and future plans aimed at advancing big‑data autonomy.
Application of Storage‑Based Real‑Time Data Warehouse Architecture at Douyin Group
The article delves into how Douyin Group employs a storage‑centric real‑time data warehouse architecture to meet massive data processing demands, analyzing warehouse construction, data quality management, and optimization strategies that enhance data‑driven decision‑making and user experience.
Financial‑Grade Real‑Time Data Warehouse Construction Practices
This paper outlines Ant Group’s real‑time data warehouse architecture, real‑time data quality assurance, unified stream‑batch applications, and data‑lake implementation outlook, providing valuable insights for financial industry real‑time warehousing.
Alibaba Cloud ClickHouse Enterprise Edition: Next‑Gen Cloud‑Native Serverless Real‑Time Data Warehouse
The article introduces Alibaba Cloud ClickHouse Enterprise Edition, a cloud‑native serverless real‑time data warehouse built on open‑source ClickHouse, discussing its core features and elastic serverless capabilities for real‑time analytics.
Best Practices of Data Warehouse Construction and Data Governance in the Financial Industry
This piece shares financial industry best practices for data warehouse building and governance, covering background, construction content, enterprise‑level warehouse implementation, governance outcomes, and future planning.
58.com User Profile Data Warehouse Construction Practice
The article presents 58.com’s experience in building a user‑profile data warehouse, describing the warehouse and profiling concepts, construction process, results, and summarizing how to create an efficient user data system.
DataFunTalk
Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.