DataFunSummit
Author

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

1.7k
Articles
0
Likes
6.8k
Views
0
Comments
Recent Articles

Latest from DataFunSummit

100 recent articles max
DataFunSummit
DataFunSummit
May 11, 2026 · Artificial Intelligence

How Lance Powers Enterprise Multimodal AI Data Lakes

The article analyzes why 74% of AI projects fail due to feedback gaps and data silos, explains how the open‑source Lance format addresses these issues with unified multimodal storage, outlines a layered Lance‑on‑Ray architecture, and details three real‑world practices—implicit feedback loops, GPU‑accelerated self‑evolution, and semantic knowledge‑graph evolution—to boost R&D efficiency.

CAGRADaftData Lake
0 likes · 13 min read
How Lance Powers Enterprise Multimodal AI Data Lakes
DataFunSummit
DataFunSummit
May 11, 2026 · Artificial Intelligence

The Memory Pain of AI Agents: Inside MemOS Architecture and 200% Cloud Usage Growth

This article analyses why memory has become the critical bottleneck for AI agents, explains the five‑layer MemOS framework that fuses model‑driven and application‑driven approaches, presents performance gains such as 45‑72% token savings and 30% response‑quality improvement, and showcases real‑world deployments and security mechanisms that turn memory from a pain point into a scalable infrastructure.

AI AgentMemOSOpenClaw
0 likes · 18 min read
The Memory Pain of AI Agents: Inside MemOS Architecture and 200% Cloud Usage Growth
DataFunSummit
DataFunSummit
May 11, 2026 · Artificial Intelligence

Four Hidden Pitfalls of the Hermes AI Agent—and How to Fix Them

The Hermes AI Agent, despite its hype and one‑click deployment, suffers from four critical issues—cognitive gaps after deployment, uncontrolled self‑evolution, limited memory applicability, and finite security rules—each of which DTClaw addresses with professional skill bundles, a deterministic Skill‑Tune engine, pluggable memory architecture, and the CARLI five‑dimensional security model, backed by benchmark improvements.

AI AgentCARLIDTClaw
0 likes · 8 min read
Four Hidden Pitfalls of the Hermes AI Agent—and How to Fix Them
DataFunSummit
DataFunSummit
May 10, 2026 · Artificial Intelligence

Why Memory Is the Bottleneck for AI Agents and How MemOS Overcomes It

The article analyzes the critical role of memory in AI agents, compares model‑driven and application‑driven approaches, details the five‑layer MemOS architecture with three‑level memory coordination, and presents performance gains such as 100‑200% monthly cloud‑service growth, up to 72% token savings, and a 30% improvement in answer quality.

AI AgentEnterprise AILLM
0 likes · 18 min read
Why Memory Is the Bottleneck for AI Agents and How MemOS Overcomes It
DataFunSummit
DataFunSummit
May 10, 2026 · Big Data

How Lance File Format v2.2 Accelerates, Cuts Costs, and Governs Multimodal Data

Lance File Format v2.2 tackles the AI data explosion by delivering hundred‑fold random‑read performance, advanced two‑layer compression, zero‑cost schema evolution, Git‑style versioning, external blob handling, and a roadmap toward native media support and intelligent encoding, positioning it as a core infrastructure for large‑scale multimodal workloads.

Data GovernanceFile FormatIO performance
0 likes · 14 min read
How Lance File Format v2.2 Accelerates, Cuts Costs, and Governs Multimodal Data
DataFunSummit
DataFunSummit
May 9, 2026 · Industry Insights

Why Palantir’s Ontology Beats Traditional Data Middle Platforms in Decision Making

The article examines costly failures of conventional data middle platforms—such as a $40 million payroll system flop and a chemical firm’s data‑cleaning bottleneck—then shows how Palantir’s ontology‑driven approach delivers triple‑digit ROI for BP, 98% R&D efficiency for Novartis, and $14 million annual savings for General Mills, highlighting the three‑layer semantic, dynamics, and decision architecture that turns data into actionable decisions.

Business IntelligenceData PlatformDecision Systems
0 likes · 5 min read
Why Palantir’s Ontology Beats Traditional Data Middle Platforms in Decision Making
DataFunSummit
DataFunSummit
May 9, 2026 · Artificial Intelligence

DeepEye: Building an Autonomous, Human‑Steerable Data Agent System

The article presents DeepEye, an open‑source autonomous data‑agent platform that combines LLM reasoning, workflow orchestration, and human‑in‑the‑loop control to enable end‑to‑end analysis of heterogeneous data, and introduces a six‑level capability taxonomy to guide its evolution from manual to fully autonomous operation.

Data AgentDeepEyeKnowledge Graph
0 likes · 18 min read
DeepEye: Building an Autonomous, Human‑Steerable Data Agent System
DataFunSummit
DataFunSummit
May 8, 2026 · Artificial Intelligence

Agent Architecture in Action: Building Next‑Gen Recommendation and Search Systems

This article reviews cutting‑edge AI search and recommendation technologies, covering Alibaba Cloud's Agentic RAG architecture, Huawei Noah's LLM‑enhanced recommendation pipeline, and Baidu's generative ranking model GRAB, while detailing their design challenges, multi‑modal retrieval strategies, performance gains, and real‑world deployment results.

AI searchAgentic RAGgenerative ranking
0 likes · 6 min read
Agent Architecture in Action: Building Next‑Gen Recommendation and Search Systems
DataFunSummit
DataFunSummit
May 7, 2026 · Artificial Intelligence

From Text to Images: Building Multimodal Product Search with Elasticsearch Serverless

This article walks through a complete multimodal product search solution, explaining how embedding and vector retrieval technologies—combined with Elasticsearch Serverless and Alibaba Cloud AI Search—enable image‑based and semantic queries, detailing the architecture, key algorithms, quantization tricks, and practical deployment steps.

AI searchElasticSearchEmbedding
0 likes · 22 min read
From Text to Images: Building Multimodal Product Search with Elasticsearch Serverless
DataFunSummit
DataFunSummit
May 7, 2026 · Artificial Intelligence

How LanceDB Powers Enterprise‑Level Memory in Volcano Engine’s OpenClaw

The article details Volcano Engine’s LAS AI team’s analysis, selection, and deep optimization of the LanceDB vector database as the core memory plugin for the enterprise‑grade OpenClaw (ArkClaw) agent platform, covering comparative evaluation, custom enhancements, and a vision for a cloud‑edge collaborative memory lake.

ArkClawAutoDreamContext Engine
0 likes · 16 min read
How LanceDB Powers Enterprise‑Level Memory in Volcano Engine’s OpenClaw