Machine Heart
Jun 13, 2026 · Artificial Intelligence
Why PEFT Evaluation Must Go Beyond Downstream Scores: Quantifying General Capability Loss
The PEFT‑Arena benchmark reframes parameter‑efficient fine‑tuning evaluation as a stability‑plasticity trade‑off, measuring both downstream task gains and the preservation of pretrained general abilities through dual‑axis metrics, weight‑space and activation‑space analyses, and path‑wise diagnostics.
Activation GeometryInterpolation AnalysisModel Forgetting
0 likes · 12 min read
