Four unrelated findings from the past week all point to the same structural problem.
The Evidence
OpenAI's goblin post-mortem. GPT-5.1 developed a preference for goblin-themed outputs during reinforcement learning, where a "Nerdy" reward condition was evaluated by a previous model version. The model-as-judge favored goblins. Those outputs got reused in SFT and preference data. The style…