Tagged articles
4 articles
Page 1 of 1
SuanNi
SuanNi
May 19, 2026 · Artificial Intelligence

Qwen 3.7 Debuts: Ranks 13th Globally and Tops China’s Model Leaderboard

Qwen 3.7‑Max‑Preview secures the 13th spot worldwide and the top position among Chinese models, while Qwen 3.7‑Plus‑Preview ranks 16th in vision, highlighting an accelerated release cadence, deeper technical depth across sub‑tasks, and a shift in China’s large‑model competition toward ecosystem control.

AI competitionChina AIModel Ranking
0 likes · 9 min read
Qwen 3.7 Debuts: Ranks 13th Globally and Tops China’s Model Leaderboard
AI Info Trend
AI Info Trend
Mar 25, 2026 · Industry Insights

Which AI Model Reigns Supreme in 2026? Insights from Arena.ai’s User‑Driven Rankings

Arena.ai’s 2026 leaderboard, built on massive blind‑test votes and an Elo‑style rating, reveals that Anthropic’s Claude series dominates text and code tasks, Google’s Gemini leads vision and image generation, while open‑source models still hold niche strengths, offering clear guidance for both casual users and developers.

Arena.aiElo RatingLarge Language Models
0 likes · 9 min read
Which AI Model Reigns Supreme in 2026? Insights from Arena.ai’s User‑Driven Rankings
IT Services Circle
IT Services Circle
Jul 22, 2025 · Artificial Intelligence

Why Kimi K2 Overtook DeepSeek to Become the Top Open‑Source AI Model

Kimi K2 has surged to the global open‑source #1 spot, ranking fifth overall and rivaling top closed‑source models, thanks to strong multi‑turn dialogue, programming, and complex‑prompt abilities, extensive community adoption, and a refined DeepSeek V3‑based architecture.

AI PerformanceDeepSeek-V3Kimi K2
0 likes · 8 min read
Why Kimi K2 Overtook DeepSeek to Become the Top Open‑Source AI Model
Xiaohongshu Tech REDtech
Xiaohongshu Tech REDtech
Jun 3, 2025 · Artificial Intelligence

Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation

The TailoredBench framework dramatically reduces large‑language‑model evaluation cost and error by using a global probe set, model‑specific source selection, extensible K‑Medoids clustering, and calibration, achieving up to 300× speedup and a 31.4% MAE reduction across diverse benchmarks.

AI researchK-MedoidsLLM evaluation
0 likes · 10 min read
Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation