Tagged articles
6 articles
Page 1 of 1
Architect's Tech Stack
Architect's Tech Stack
Apr 27, 2026 · Artificial Intelligence

Can Your RAG System Pass the Demo and Remain Accurate Across 5,000 Documents?

The article dissects a tough interview question about building a production‑grade Retrieval‑Augmented Generation (RAG) system that not only works in a demo but also delivers stable, correct answers over a knowledge base of 5,000 documents, covering chunking, hybrid retrieval, intent routing, constrained generation, evaluation metrics, and operational safeguards.

Evaluation MetricsHybrid RetrievalIntent Routing
0 likes · 15 min read
Can Your RAG System Pass the Demo and Remain Accurate Across 5,000 Documents?
DataFunTalk
DataFunTalk
Oct 22, 2025 · Artificial Intelligence

How Large Language Models Power Xiaomi’s Xiao AI Assistant

This article explains how Xiaomi’s Xiao AI assistant leverages large language models for intent routing, domain‑specific intent understanding, and response generation, detailing the system architecture, challenges such as knowledge requirements and latency constraints, and the shift from prompt engineering to model fine‑tuning.

AI AssistantIntent RoutingPrompt engineering
0 likes · 5 min read
How Large Language Models Power Xiaomi’s Xiao AI Assistant
DataFunTalk
DataFunTalk
Oct 10, 2025 · Artificial Intelligence

How Large Language Models Power Xiaomi’s Xiao AI Assistant

This article explains how large language models are integrated into Xiaomi’s Xiao AI assistant, covering intent distribution, domain‑specific intent understanding, response generation, architectural design, challenges such as knowledge requirements and latency, and the shift from prompt engineering to model fine‑tuning.

AI AssistantIntent RoutingPrompt engineering
0 likes · 5 min read
How Large Language Models Power Xiaomi’s Xiao AI Assistant
DataFunSummit
DataFunSummit
Oct 5, 2025 · Artificial Intelligence

How Xiaomi’s XiaoAI Harnesses Large Models for Intent Routing and Response Generation

This article explains how Xiaomi’s XiaoAI assistant integrates large language models for intent distribution, vertical intent understanding, and response generation, detailing the architecture, challenges such as knowledge requirements and sub‑200 ms latency, and the shift from prompt engineering to model fine‑tuning that boosted user retention by 10% and query satisfaction by 8%.

AI AssistantIntent RoutingPrompt engineering
0 likes · 4 min read
How Xiaomi’s XiaoAI Harnesses Large Models for Intent Routing and Response Generation
DataFunSummit
DataFunSummit
Sep 29, 2025 · Artificial Intelligence

How Large Language Models Power XiaoAI: From Intent Routing to Response Generation

This article explores how large language models are integrated into Xiaomi’s XiaoAI assistant, detailing the system’s architecture, intent distribution, domain-specific understanding, and response generation, while sharing practical challenges, prompt engineering solutions, and fine‑tuning strategies that boosted user retention and query satisfaction.

AI assistantsIntent RoutingXiaoAI
0 likes · 4 min read
How Large Language Models Power XiaoAI: From Intent Routing to Response Generation
DataFunSummit
DataFunSummit
Aug 27, 2024 · Artificial Intelligence

Applying Large Models to Xiao AI Assistant: Intent Routing, Understanding, and Response Generation

This article presents a comprehensive technical overview of how large language models are integrated into Xiaomi's Xiao AI assistant, detailing the architecture for intent routing, domain‑specific intent understanding, function‑calling mechanisms, fine‑tuning strategies, performance gains, and future research directions.

AI AssistantFunction CallingIntent Routing
0 likes · 14 min read
Applying Large Models to Xiao AI Assistant: Intent Routing, Understanding, and Response Generation