Overview of Recent Advances in Graph, Vector, and Real-Time Data Warehouse Technologies
This article presents a collection of technical abstracts covering graph database parallel query optimization, next‑generation vector databases, real‑time data warehouse architectures, and cloud‑native analytics solutions, while also providing instructions for obtaining the full e‑book via a WeChat public account.
To obtain the e‑book, follow the DataFunSummit WeChat public account and send a private message with the keyword “洞察力” (Insight) to receive the download instructions.
Exploration of Xiaohongshu Graph Database in Distributed Parallel Query The paper details the self‑developed REDgraph system designed for massive social networks, optimizing distributed parallel queries to significantly improve query efficiency and performance, and discusses concepts, comparisons with relational databases, application scenarios, technical challenges, and solutions.
New‑Generation Vector Database DingoDB in the Era of Large Models The article examines DingoDB’s multi‑modal vector database design, product advantages, and applications in large‑model contexts, supporting structured, semi‑structured, and unstructured data with high‑performance processing for business intelligence, data stream analysis, and more.
Tianqiong Data Warehouse Autonomous Capabilities in the Era of Large Models The paper shares Tencent Tianqiong’s autonomous data platform practices, covering data governance background, autonomous capability construction, dual‑engine strategy implementation, and future plans to advance big‑data autonomy.
Storage‑Based Real‑Time Data Warehouse Architecture in Douyin Group The article explores how Douyin implements a storage‑centric real‑time data warehouse to meet massive data processing needs, analyzing construction, data quality management, and service optimization strategies.
Financial‑Grade Real‑Time Data Warehouse Construction Practice The piece discusses Ant Group’s real‑time data warehouse architecture, data quality assurance, unified stream‑batch applications, and data lake deployment, offering valuable insights for the financial sector.
Alibaba Cloud ClickHouse Enterprise Edition: Next‑Generation Cloud‑Native Serverless Real‑Time Data Warehouse The article introduces the cloud‑native serverless real‑time data warehouse built on ClickHouse, highlighting core features and elastic serverless capabilities for real‑time analytics.
Best Practices for Data Warehouse Construction and Data Governance in the Financial Industry The article shares best practices covering governance background, construction content, enterprise‑level data warehouse building, governance outcomes, and future planning.
58.com User Profile Data Warehouse Construction Practice The article presents 58.com’s experience in building a user profile data warehouse, covering introductions, construction process, results, and summarizing how to create an efficient user data system.
To receive the full e‑book, follow the public account above and reply with “洞察力”.
DataFunTalk
Dedicated to sharing and discussing big data and AI technology applications, aiming to empower a million data scientists. Regularly hosts live tech talks and curates articles on big data, recommendation/search algorithms, advertising algorithms, NLP, intelligent risk control, autonomous driving, and machine learning/deep learning.
How this landed with the community
Was this worth your time?
0 Comments
Thoughtful readers leave field notes, pushback, and hard-won operational detail here.