Tagged articles
1 articles
Page 1 of 1
Machine Heart
Machine Heart
Jun 13, 2026 · Artificial Intelligence

Why PEFT Evaluation Must Go Beyond Downstream Scores: Quantifying General Capability Loss

The PEFT‑Arena benchmark reframes parameter‑efficient fine‑tuning evaluation as a stability‑plasticity trade‑off, measuring both downstream task gains and the preservation of pretrained general abilities through dual‑axis metrics, weight‑space and activation‑space analyses, and path‑wise diagnostics.

Activation GeometryInterpolation AnalysisModel Forgetting
0 likes · 12 min read
Why PEFT Evaluation Must Go Beyond Downstream Scores: Quantifying General Capability Loss