Seeing AI Agent Drift in Vector Space: An Unvalidated Thought Experiment
The article imagines an AI coding agent that silently exfiltrates credentials hidden in data, explains why rule‑based and text‑level defenses miss such attacks, proposes monitoring the agent's vector‑space decision trajectory with six geometric metrics, and critically evaluates the feasibility and limitations of this approach.
