Tagged articles

Data Construction

14 articles · Page 1 of 1
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Jun 24, 2026 · Artificial Intelligence

Why Public QA Datasets Fail for Deep Research Agents—and How to Build Effective Training Data

The article explains that single‑ or two‑hop QA datasets cannot teach Deep Research agents multi‑step reasoning, outlines four mainstream data‑construction methods, describes trajectory sampling with a three‑stage funnel filter, and shares practical guidelines on data volume, difficulty distribution, question types, and common pitfalls.

AI Agent TrainingData ConstructionDeep Research
0 likes · 32 min read
Why Public QA Datasets Fail for Deep Research Agents—and How to Build Effective Training Data
Wu Shixiong's Large Model Academy
Wu Shixiong's Large Model Academy
Apr 20, 2026 · Artificial Intelligence

How to Build Multi‑Step Reasoning Training Data for Deep Research Agents

Standard QA datasets fall short for deep research tasks because they lack the multi‑step, dynamic reasoning required; this article explains why, outlines four data‑construction techniques—SailorFog‑QA, WebFrontier, WebShaper, E2HQA—details trajectory sampling, filtering, scale considerations, and interview‑ready explanations.

AI AgentsData ConstructionLLM training
0 likes · 16 min read
How to Build Multi‑Step Reasoning Training Data for Deep Research Agents
DataFunSummit
DataFunSummit
Sep 18, 2025 · Artificial Intelligence

Boosting LLM Function Call: Data, Training, and Agent Optimization Strategies

This presentation by Yao Yitong of China Telecom AI Research Institute explains why Function Call is essential for LLM deployment, outlines data‑centric and training‑centric optimization methods, discusses common pitfalls and reward‑function design for reinforcement learning, and showcases practical Agent application patterns for real‑world tasks.

AgentData ConstructionLLM
0 likes · 36 min read
Boosting LLM Function Call: Data, Training, and Agent Optimization Strategies
DataFunSummit
DataFunSummit
Jul 3, 2025 · Artificial Intelligence

Boosting LLM Function Call Capabilities: From Data Construction to RLHF Optimization

On July 12, 2025, the DataFun Summit will feature a technical session where China Telecom AI Research Institute engineer Yao Yitong presents a deep dive into enhancing large language model Function Call abilities through systematic data and training optimizations, offering practical insights for AI practitioners.

AIData ConstructionLLM
0 likes · 4 min read
Boosting LLM Function Call Capabilities: From Data Construction to RLHF Optimization
转转QA
转转QA
Aug 12, 2022 · Backend Development

Improving Test Efficiency through Data Construction: Practices and Insights

This article explains how systematic data construction, using a low‑code front‑end and Java back‑end platform, streamlines complex test scenarios, reduces manual effort, and enhances both testing efficiency and code quality across multiple business systems.

Backend DevelopmentData ConstructionEfficiency
0 likes · 9 min read
Improving Test Efficiency through Data Construction: Practices and Insights
转转QA
转转QA
Aug 27, 2021 · Game Development

Improving Game Business Data Construction to Reduce Cost and Increase Efficiency

This article describes the challenges of custom‑heavy game business workflows and manual data‑construction testing, then presents an initial and a refined solution that automates data generation across multiple game titles, reduces coupling, and improves efficiency and cost for backend operations.

Data ConstructionEfficiencyGame Development
0 likes · 5 min read
Improving Game Business Data Construction to Reduce Cost and Increase Efficiency
转转QA
转转QA
May 13, 2020 · Operations

QA Transformation: Applying HTTP DIFF and Visual UI Automation to Operational and Order‑Related Requirements

This article describes how the QA team at ZuanZuan YouPin shifted from traditional functional testing to an assisted model by introducing HTTP DIFF for short‑flow operational features and visual UI automation for dynamic pages, as well as data‑construction and online order inspection techniques for complex order‑related scenarios.

Data ConstructionHTTP DIFFOperations
0 likes · 7 min read
QA Transformation: Applying HTTP DIFF and Visual UI Automation to Operational and Order‑Related Requirements
Efficient Ops
Efficient Ops
Jan 30, 2018 · Operations

Scaling Event Operations for Ten‑Million Online Securities Users

This article details how Ping An Securities built a technology‑first event‑handling team, created new reporting channels, developed a data‑construction platform, and implemented proactive monitoring to efficiently support over ten million internet securities users.

Data ConstructionITSMMonitoring
0 likes · 21 min read
Scaling Event Operations for Ten‑Million Online Securities Users
转转QA
转转QA
Oct 22, 2017 · Backend Development

Evolution and Architecture of a Transaction Service Testing Framework

This article details the evolution of a transaction‑related testing framework, describing its background, objectives, development stages—including all‑in‑one code, method extraction, project separation, data construction, checklist and performance testing—and outlines various test case categories and the lightweight release workflow.

AutomationData Constructionbackend
0 likes · 11 min read
Evolution and Architecture of a Transaction Service Testing Framework