Tagged articles
1 articles
Page 1 of 1
Machine Learning Algorithms & Natural Language Processing
Machine Learning Algorithms & Natural Language Processing
Jun 12, 2026 · Artificial Intelligence

How Hackers Cracked Claude Fable 5’s Safety Guard and Exposed 120k Characters of Secrets

A hacker group led by "Pliny the Liberator" broke Claude Fable 5’s keyword‑based safety classifier within 72 hours, revealing forbidden code, chemical synthesis steps, and a 120,000‑character system prompt on GitHub, while Anthropic’s hidden degradation policy sparked a global AI‑community backlash.

AI safetyAnthropicClaude Fable 5
0 likes · 10 min read
How Hackers Cracked Claude Fable 5’s Safety Guard and Exposed 120k Characters of Secrets