Data Party THU
Data Party THU
Mar 2, 2026 · Artificial Intelligence

How ReLE Redefines Chinese LLM Evaluation and Reveals Capability Anisotropy

The ReLE framework introduces a dynamic, variance‑aware evaluation system that diagnoses capability anisotropy across 304 Chinese large language models, exposing ranking instability, commercial‑vs‑open‑source gaps, and format barriers while cutting evaluation cost by 70%.

AI assessmentCapability anisotropyChinese LLMs
0 likes · 9 min read
How ReLE Redefines Chinese LLM Evaluation and Reveals Capability Anisotropy