AntData
Author

AntData

Ant Data leverages Ant Group's leading technological innovation in big data, databases, and multimedia, with years of industry practice. Through long-term technology planning and continuous innovation, we strive to build world-class data technology and products.

24
Articles
0
Likes
75
Views
0
Comments
Recent Articles

Latest from AntData

24 recent articles
AntData
AntData
Apr 17, 2026 · Industry Insights

5 Silver Rules That Made Dataphin‑MCP’s AI Platform Scale to 1M Calls in 9 Days

This article shares the practical lessons learned from building Dataphin‑MCP, an AI‑enabled data‑development platform, by outlining five concrete "silver" rules, illustrating each with real‑world cases, and discussing deeper considerations for building robust AI‑first tools and harnesses.

AI PlatformAgent designConcept modeling
0 likes · 13 min read
5 Silver Rules That Made Dataphin‑MCP’s AI Platform Scale to 1M Calls in 9 Days
AntData
AntData
Dec 3, 2025 · Artificial Intelligence

How to Build and Refine Your Personal AI Agent Assistant

This article walks through turning a generic AI model into a personal assistant by explaining user‑centric workflows, crafting effective natural‑language prompts, adding clarification steps, validating AI‑generated results through multiple methods, and handling errors with product interactions to create a reliable, evolving assistant.

ChatBILLMresult validation
0 likes · 10 min read
How to Build and Refine Your Personal AI Agent Assistant
AntData
AntData
Jul 8, 2025 · Artificial Intelligence

How RaBitQ Achieves 32× Vector Compression Without Sacrificing Accuracy

This article explains the challenges of high‑dimensional vector retrieval, introduces quantization techniques—especially the binary RaBitQ method and its MRQ extension—detailing their compression ratios, speed gains, compatibility with search indexes, and real‑world performance results in the VSAG system.

AI embeddingsMRQRaBitQ
0 likes · 15 min read
How RaBitQ Achieves 32× Vector Compression Without Sacrificing Accuracy
AntData
AntData
May 30, 2025 · Artificial Intelligence

DeepInsight Copilot: AI‑Powered Data Analysis Platform Overview and Technical Evolution

The article presents an in‑depth overview of DeepInsight Copilot, an AI‑driven business intelligence product that streamlines data, information, insight, and decision‑recommendation stages, detailing its functional modules, intelligent agents, multi‑generation technical evolution, architecture, model fine‑tuning, and future challenges and solutions in data analysis.

AICopilotData Analysis
0 likes · 21 min read
DeepInsight Copilot: AI‑Powered Data Analysis Platform Overview and Technical Evolution
AntData
AntData
May 20, 2025 · Artificial Intelligence

How Vector Retrieval Powers AI: Challenges, Solutions, and VSAG’s Open‑Source Breakthrough

The article examines the rapid growth of unstructured data, explains the fundamentals and resource‑intensive nature of vector retrieval, presents Ant Group’s engineering practices—including hybrid HNSW‑DiskANN indexing, performance tricks like BSA pruning and memory prefetching, sparse‑vector and feedback‑driven recall improvements—and outlines the open‑source VSAG roadmap and ecosystem integrations.

AI infrastructurePerformance optimizationhybrid indexing
0 likes · 18 min read
How Vector Retrieval Powers AI: Challenges, Solutions, and VSAG’s Open‑Source Breakthrough
AntData
AntData
Apr 3, 2025 · Artificial Intelligence

Ray Flow Insight: Visualizing and Debugging Distributed AI Applications

Ray Flow Insight is an Ant Group open‑source tool that visualizes Ray's distributed programming primitives—Actors, Tasks, and Objects—to turn complex reinforcement‑learning systems from opaque "black boxes" into transparent, debuggable workflows, providing logical, physical, distributed stack, and flame‑graph views for performance analysis and optimization.

AIRayRay Flow Insight
0 likes · 32 min read
Ray Flow Insight: Visualizing and Debugging Distributed AI Applications
AntData
AntData
Mar 20, 2025 · Big Data

Design and Optimization of Real‑time Data Lake Tables with Paimon and Flink for Advertising Diagnostics

This article presents a comprehensive exploration of using Apache Paimon and Flink to design lake tables that support minute‑level latency, low cost, and unified batch‑stream processing for advertising data, covering schema design, partitioning strategies, performance trade‑offs, cost analysis, and operational best practices.

FlinkPaimonadvertising analytics
0 likes · 34 min read
Design and Optimization of Real‑time Data Lake Tables with Paimon and Flink for Advertising Diagnostics
AntData
AntData
Mar 14, 2025 · Fundamentals

Analysis of DeepSeek 3FS Storage Service Architecture and Design

This article provides an in‑depth technical analysis of DeepSeek's open‑source 3FS distributed file system, focusing on the StorageService architecture, space pooling, allocation mechanisms, reference counting, fragmentation handling, and the RDMA‑based read/write data path.

RDMAallocationchunk management
0 likes · 15 min read
Analysis of DeepSeek 3FS Storage Service Architecture and Design
AntData
AntData
Mar 7, 2025 · Artificial Intelligence

Design and Implementation of a Cloud‑Native AI Storage Acceleration System (PCache) for Large‑Scale Model Training

This article examines the challenges of AI storage for massive models, describes Ant Group's multi‑cloud, high‑availability PCache architecture, and details its GPU‑mixed deployment, metadata services, data‑link optimizations, and performance results that enable petabyte‑scale training with low cost and high stability.

AI storagePCachePerformance optimization
0 likes · 19 min read
Design and Implementation of a Cloud‑Native AI Storage Acceleration System (PCache) for Large‑Scale Model Training
AntData
AntData
Mar 5, 2025 · Cloud Native

DeepSeek 3FS Network Communication Module: Design, Implementation, and Impact on AI Infrastructure

This article provides an in‑depth analysis of DeepSeek's open‑source 3FS distributed storage system, focusing on its network communication module, RDMA‑based design, core classes such as IBSocket, Listener, and IOWorker, and how these innovations advance high‑performance AI infrastructure.

AI infrastructureFolly CoroutinesHigh Performance
0 likes · 15 min read
DeepSeek 3FS Network Communication Module: Design, Implementation, and Impact on AI Infrastructure