AI safety researchers get to grapple with a peculiar problem: how to stop AIs from 'studying' their own safety tests.
Share this post
When AIs Cheat on Their Safety Exams
Share this post
AI safety researchers get to grapple with a peculiar problem: how to stop AIs from 'studying' their own safety tests.