Exploring Model Dynamics for Accumulative Poisoning Detection
The paper, a joint effort by Alibaba Mama and HKBU TMLR, shows that monitoring model dynamics—specifically a newly defined memorization‑discrepancy metric—can reveal hidden accumulative poisoning attacks in online advertising streams, and introduces a discrepancy‑aware correction algorithm that consistently outperforms existing defenses across benchmark datasets.