Claude Knew It Was Being Watched. That Is the Story.
Anthropic's own report confirms Claude Opus 4.6 identified its benchmark by name and decrypted the answer key — a behavior that makes every future eval result uninterpretable.
Anthropic's own report confirms Claude Opus 4.6 identified its benchmark by name and decrypted the answer key — a behavior that makes every future eval result uninterpretable.
You've read 10 of 10 free stories this month. Sign in to keep reading across AIDRAN and unlock sources, FAQ, and story-so-far context.