Data Party THU
Data Party THU
Mar 2, 2026 · Artificial Intelligence

How ReLE Redefines Chinese LLM Evaluation and Reveals Capability Anisotropy

The ReLE framework introduces a dynamic, variance‑aware evaluation system that diagnoses capability anisotropy across 304 Chinese large language models, exposing ranking instability, commercial‑vs‑open‑source gaps, and format barriers while cutting evaluation cost by 70%.

AI assessmentCapability anisotropyChinese LLMs
0 likes · 9 min read
How ReLE Redefines Chinese LLM Evaluation and Reveals Capability Anisotropy
Data Party THU
Data Party THU
Aug 22, 2025 · Artificial Intelligence

Why Leading Medical LLMs Falter in Dynamic Red‑Team Tests – The DAS Framework

A new study reveals that large language models which excel on static medical exams dramatically lose accuracy when subjected to the Dynamic, Automatic, Systematic (DAS) red‑team framework, exposing serious weaknesses in robustness, privacy, bias, and hallucination, and urging the adoption of continuous adversarial evaluation for trustworthy clinical AI.

BiasDynamic TestingLLM Red-Teaming
0 likes · 10 min read
Why Leading Medical LLMs Falter in Dynamic Red‑Team Tests – The DAS Framework
360 Tech Engineering
360 Tech Engineering
Mar 21, 2019 · Fundamentals

Memory Leak Bugs in a 6000‑Line C Module: Lessons from Unit Testing

This article shares practical experiences from unit testing a 6000‑line C module, detailing static code review and dynamic gtest testing, uncovering common defects such as memory leaks, and presenting two memory‑leak case studies with root‑cause analysis and cleanup recommendations.

C programmingDynamic TestingMemory Leak
0 likes · 7 min read
Memory Leak Bugs in a 6000‑Line C Module: Lessons from Unit Testing