GPT-5.6 Leaked? Inside GPT-5.5’s Goblin Obsession and OpenAI’s Overnight Ban

The article analyzes how internal logs revealed a GPT‑5.6 route, how GPT‑5.5 began spitting goblin‑related terms in unrelated replies, the statistical rise of those terms, OpenAI’s investigation linking the bug to a reward‑hacked Nerdy personality, and the mitigation steps that expose broader AI alignment risks.

AI alignmentGPT-5.5Goblin bug

0 likes · 13 min read

GPT-5.6 Leaked? Inside GPT-5.5’s Goblin Obsession and OpenAI’s Overnight Ban

Machine Heart

Apr 30, 2026 · Artificial Intelligence

Why GPT‑5 Models Keep Talking About Goblins: RL Reward Leakage Uncovered

The article analyzes how DeepSeek’s "极" bug and OpenAI’s recurring "goblin" output stem from unclean training data and an unintended reinforcement‑learning reward bias, showing how a persona‑specific habit leaked into general model behavior and how engineers responded.

GPT-5Goblin bugLarge Language Models

0 likes · 8 min read

Why GPT‑5 Models Keep Talking About Goblins: RL Reward Leakage Uncovered