Do AI Models Really Have Introspective Awareness? Anthropic’s New Findings
Anthropic’s recent study reveals that large language models like Claude Opus 4 exhibit functional introspective awareness, defining rigorous criteria for true introspection and demonstrating through four experiments how models can recognize, report, and even control their internal states, though the capability remains unstable and context‑dependent.
