Tagged articles
2 articles
Page 1 of 1
ByteDance Data Platform
ByteDance Data Platform
Jan 15, 2026 · Artificial Intelligence

Why Model Evaluation Can Be Cool: Innovative Automated Testing for Data‑Driven LLM Agents

In the era of rapidly advancing large‑model technology, the article outlines the challenges of evaluating data‑centric LLM agents, proposes a three‑layer evaluation framework covering basic capabilities, component‑level checks, and end‑to‑end business impact, and shares practical innovations such as semantic‑equivalence SQL matching, agent‑as‑judge pipelines, and a unified assessment platform.

Agent as judgeAutomated TestingBig Data
0 likes · 22 min read
Why Model Evaluation Can Be Cool: Innovative Automated Testing for Data‑Driven LLM Agents
DataFunTalk
DataFunTalk
Dec 28, 2020 · Artificial Intelligence

Intelligent Question Answering: Scenarios, Architecture, and Technical Implementations (QA, Knowledge‑Graph QA, NL2SQL)

This article introduces the typical applications of intelligent question answering, compares chat‑type, knowledge‑type and task‑type bots, and then details the end‑to‑end architecture, knowledge‑base construction, semantic‑equivalence modeling with BERT‑BIMPM, knowledge‑graph QA pipelines, and NL2SQL techniques, concluding with practical deployment insights.

BERTDialogue SystemsNL2SQL
0 likes · 15 min read
Intelligent Question Answering: Scenarios, Architecture, and Technical Implementations (QA, Knowledge‑Graph QA, NL2SQL)