Discussion about this post

User's avatar
Charlie Nadeau's avatar

Bringing blogs back I see! Great first piece! Very readable and intriguing, even for someone who doesn't really know much about the AI safety world. You mentioned that the AI becomes either more or less compliant once it realizes something may be a test, and I'm curious about how much more variable that response is?

Expand full comment