Advances in Robust AI: Defending Adversarial Attacks, Boosting Domain Generalization, Stopping LLM Jailbreaks
This article reviews the latest progress in designing algorithms with strong robustness, covering adversarial examples in computer vision, novel training paradigms and certification methods, domain‑generalization techniques that achieve state‑of‑the‑art performance in medical imaging and molecular recognition, and new attack‑defense strategies for LLM jailbreak scenarios.
