DataFunSummit
Author

DataFunSummit

Official account of the DataFun community, dedicated to sharing big data and AI industry summit news and speaker talks, with regular downloadable resource packs.

1.6k
Articles
0
Likes
4.7k
Views
0
Comments
Recent Articles

Latest from DataFunSummit

100 recent articles max
DataFunSummit
DataFunSummit
Mar 10, 2026 · Artificial Intelligence

How Agent Lightning Redefines AI Agent Learning with Optimizer‑Agent Decoupling

The article explores the paradigm shift toward AI agents in 2025, detailing the open‑source Agent Lightning project’s architecture, non‑intrusive experience capture, programmable pipelines, and experimental results that demonstrate its ability to enable reinforcement learning for any agent with minimal code changes.

Agent LightningOpen‑source Frameworkmachine learning
0 likes · 20 min read
How Agent Lightning Redefines AI Agent Learning with Optimizer‑Agent Decoupling
DataFunSummit
DataFunSummit
Mar 9, 2026 · Artificial Intelligence

How SkillNet Turns Agentic Skills into Reusable Knowledge for Smarter AI

SkillNet introduces a large‑scale, structured skill knowledge base that lets AI agents capture, share, and reuse procedural abilities, dramatically improving benchmark performance and paving the way for more reliable, evolvable intelligent systems.

AI agentsKnowledge GraphPython SDK
0 likes · 13 min read
How SkillNet Turns Agentic Skills into Reusable Knowledge for Smarter AI
DataFunSummit
DataFunSummit
Mar 3, 2026 · Backend Development

How Ant Group Supercharged AI Data Pipelines with Ray: Boosting Index Build Speed and Reliability

This article details Ant Group's use of the Ray distributed computing framework to accelerate massive data indexing, migrate a C++ engine to Ray, implement elastic resource scheduling, improve long‑tail task efficiency, and build a robust RAG operator system with comprehensive governance, achieving up to 2× speed gains and 99.9% success rates.

Distributed ComputingRayai data pipeline
0 likes · 15 min read
How Ant Group Supercharged AI Data Pipelines with Ray: Boosting Index Build Speed and Reliability
DataFunSummit
DataFunSummit
Mar 2, 2026 · Artificial Intelligence

How Data-Juicer Powers Multi‑Modal Data Processing for Large Language Models

This article explains the evolution of Data‑Juicer from a pure‑text preprocessing tool to a full‑stack multi‑modal data engine, detailing its architecture, operator library, Ray‑based distributed execution, performance benchmarks, integration with AI agents, and roadmap for future AI‑centric data workflows.

Data-JuicerMulti-ModalRay
0 likes · 31 min read
How Data-Juicer Powers Multi‑Modal Data Processing for Large Language Models
DataFunSummit
DataFunSummit
Mar 1, 2026 · Big Data

How Ant Group’s Flex Engine Supercharges Flink with Vectorization

This article details Ant Group’s Flex vectorized engine built on Velox, covering the current state of vectorization, Flex’s architecture (Flink + Velox), core feature development, correctness guarantees, large‑scale deployment results, and future directions for full‑link vectorization and broader hardware support.

FlexFlinkVectorization
0 likes · 18 min read
How Ant Group’s Flex Engine Supercharges Flink with Vectorization
DataFunSummit
DataFunSummit
Feb 27, 2026 · Artificial Intelligence

How Large Language Models Are Revolutionizing Ad Recommendation and Solving Cold‑Start Problems

This article explains how advertising recommendation is evolving from traditional feature‑engineered models to LLM‑driven pipelines, detailing data‑infrastructure challenges, semantic upgrades with multimodal embeddings, case studies in short‑video ads, user cold‑start prompt engineering, and future directions for generative recommendation systems.

Ad TechLLMRecommendation Systems
0 likes · 12 min read
How Large Language Models Are Revolutionizing Ad Recommendation and Solving Cold‑Start Problems
DataFunSummit
DataFunSummit
Feb 26, 2026 · Artificial Intelligence

How Alibaba Cloud’s Aivis Redefines AI‑Powered Service Agents with Multi‑Agent Architecture

This article systematically explains the evolution of Alibaba Cloud’s intelligent service platform, focusing on the Aivis digital employee, its three‑layer Planner‑Reasoner‑Executor architecture, context‑engineering optimizations, multi‑agent workflow, and practical recommendations for building enterprise‑grade AI‑driven customer service solutions.

ArchitectureContext EngineeringDigital Employee
0 likes · 24 min read
How Alibaba Cloud’s Aivis Redefines AI‑Powered Service Agents with Multi‑Agent Architecture
DataFunSummit
DataFunSummit
Feb 25, 2026 · Artificial Intelligence

Why RAG Fails in Production and How to Fix It: Expert Insights

This article summarizes a DataFun‑hosted roundtable where leading AI experts dissect the gap between RAG’s promise and real‑world deployment, exposing low recall, hallucinations, and cost overruns, then present systematic diagnostics, evaluation metrics, hybrid search, and engineering best practices to reliably operationalize RAG in enterprise settings.

Enterprise AIHybrid SearchLLM
0 likes · 18 min read
Why RAG Fails in Production and How to Fix It: Expert Insights