DataFunSummit
Author

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

1.6k
Articles
0
Likes
4.7k
Views
0
Comments
Recent Articles

Latest from DataFunSummit

100 recent articles max
DataFunSummit
DataFunSummit
Apr 4, 2026 · Databases

How Hologres 4.0 Unifies Real‑Time Data Warehousing and AI‑Native Analytics

This article analyzes the architectural evolution of Alibaba Cloud Hologres from a fragmented multi‑engine data stack to the All‑in‑One design of Hologres 4.0, detailing its multimodal search, AI‑native functions, performance benchmarks, resource governance, lake integration, and real‑world deployment scenarios.

AI-native analyticsHologresReal-time Data Warehouse
0 likes · 12 min read
How Hologres 4.0 Unifies Real‑Time Data Warehousing and AI‑Native Analytics
DataFunSummit
DataFunSummit
Apr 1, 2026 · Artificial Intelligence

Why RAG Fails in Production and How to Fix It: Expert Insights

This article analyzes why Retrieval‑Augmented Generation (RAG) often underperforms in enterprise production, identifies eight common pitfalls—from document parsing to token costs—and offers a systematic roadmap of diagnostics, hybrid search, reranking, and deployment strategies presented by leading AI experts.

AIEnterpriseRAG
0 likes · 18 min read
Why RAG Fails in Production and How to Fix It: Expert Insights
DataFunSummit
DataFunSummit
Mar 31, 2026 · Industry Insights

How SelectDB Overcomes the ‘Impossible Triangle’ in Real‑Time Automotive Data

The whitepaper explains how the explosive growth, multimodal nature, and real‑time collaboration demands of intelligent connected‑vehicle data create two “impossible triangles,” and how SelectDB’s three technical innovations—Index+Bitmap primary keys, Variant sparse columns, and hybrid full‑text/vector search—enable cost‑effective, high‑performance real‑time analytics across five automotive scenarios with proven case studies from leading OEMs.

Database InnovationSelectDBautomotive
0 likes · 17 min read
How SelectDB Overcomes the ‘Impossible Triangle’ in Real‑Time Automotive Data
DataFunSummit
DataFunSummit
Mar 29, 2026 · Artificial Intelligence

How Code Intelligence Is Evolving: From Foundation Models to Repository‑Level Agents

This article reviews the rapid evolution of code intelligence, covering the history of code foundation models, reinforcement‑learning optimizations, scaling‑law insights, the LoopCoder architecture, rigorous multi‑level evaluation suites, and the emergence of repository‑level code agents, while highlighting open‑source contributions such as Qwen‑Coder.

code evaluationcode intelligencereinforcement learning
0 likes · 15 min read
How Code Intelligence Is Evolving: From Foundation Models to Repository‑Level Agents
DataFunSummit
DataFunSummit
Mar 29, 2026 · Artificial Intelligence

How to Build a Multimodal Product Search Engine with Embedding and Vector Retrieval on Elasticsearch Serverless

This article explores the evolution of e‑commerce search toward multimodal and cross‑modal capabilities, outlines a generic architecture that combines text and image processing via embedding and vector retrieval, and demonstrates how to implement the solution using Alibaba Cloud's AI Search Open Platform and Elasticsearch Serverless with detailed guidance on models, similarity metrics, quantization, and performance optimization.

AIElasticsearchEmbedding
0 likes · 22 min read
How to Build a Multimodal Product Search Engine with Embedding and Vector Retrieval on Elasticsearch Serverless
DataFunSummit
DataFunSummit
Mar 27, 2026 · Industry Insights

Why Traditional Data Platforms Fail and How Ontology Delivers Triple‑Digit ROI

The article examines costly data platform failures—such as a $40 million payroll system collapse and a healthcare.gov outage—highlighting why traditional data middle platforms become data swamps, then explains how Palantir’s ontology approach, with its three‑layer semantic, dynamics, and decision architecture, can turn data into actionable insights and achieve triple‑digit ROI.

PalantirROIdata architecture
0 likes · 4 min read
Why Traditional Data Platforms Fail and How Ontology Delivers Triple‑Digit ROI
DataFunSummit
DataFunSummit
Mar 26, 2026 · Industry Insights

Why Traditional Data Platforms Fail and How Ontology Drives Triple‑Digit ROI

The article analyzes costly data‑platform failures—such as a $40 million school‑district payroll system and a collapsed Healthcare.gov launch—identifies the root cause as ineffective data middle platforms, and explains how Palantir’s ontology‑based three‑layer architecture (semantic, dynamics, decision) transforms raw data into automated business actions, delivering measurable ROI across multiple industries.

Decision AutomationPalantirdata architecture
0 likes · 5 min read
Why Traditional Data Platforms Fail and How Ontology Drives Triple‑Digit ROI
DataFunSummit
DataFunSummit
Mar 25, 2026 · Big Data

How Apache Gravitino and OpenLineage Transform Data Governance for AI‑Driven Enterprises

In the era of AI and multi‑cloud, this article analyzes the core challenges of data governance—data silos, quality gaps, and compliance risks—and explains how Apache Gravitino’s unified metadata architecture together with OpenLineage’s standardized lineage model provide a scalable, automated solution for intelligent, real‑time data management.

Apache GravitinoOpenLineagebig data
0 likes · 15 min read
How Apache Gravitino and OpenLineage Transform Data Governance for AI‑Driven Enterprises
DataFunSummit
DataFunSummit
Mar 24, 2026 · Artificial Intelligence

How to Build a Multimodal Product Search System with Embedding and Vector Retrieval

This article presents a comprehensive, end‑to‑end solution for multimodal product search, detailing the evolution from keyword to image‑based queries, the core embedding and vector retrieval technologies, practical Elasticsearch Serverless integration, quantization methods, and a complete demo workflow for building a high‑performance, low‑cost search platform.

AI search platformElasticsearchEmbedding
0 likes · 21 min read
How to Build a Multimodal Product Search System with Embedding and Vector Retrieval