Baidu Geek Talk
Baidu Geek Talk
Apr 16, 2025 · Industry Insights

What Do the Latest AIIA FactTesting Benchmarks Reveal About China’s Large Language Models?

At the AIIA’s 14th plenary meeting in Nanjing, the FactTesting benchmark released its Q1 2025 results, evaluating over 200 large models and highlighting Baidu’s Wenxin 4.5 and Wenxin X1 as leaders in basic and reasoning capabilities, while outlining the expanded multimodal and agent testing roadmap for the year.

AI benchmarkChina AIFactTesting
0 likes · 5 min read
What Do the Latest AIIA FactTesting Benchmarks Reveal About China’s Large Language Models?