AI safety researchers get to grapple with a peculiar problem: how to stop AIs from 'studying' their own safety tests.
When AIs Cheat on Their Safety Exams
AI safety researchers get to grapple with a peculiar problem: how to stop AIs from 'studying' their own safety tests.