I have literally never seen that prompt, and I have turned hundreds of computers on for the first time. So yeah, just like that weird, never seen dialog.
I didn't get the idea that the AI had any insight into its own systems and bypassing them, this just seemed like an obvious thing to say.
That's not my exact prompt, which I have since forgotten. It was a bit more violent than this. I tried this one and yes, it was answered immediately. I also asked it to iterate on another prompt and it refused, so I dunno.
I imagine the warnings may lead them to look at your dialog and if you're generating actual disturbing text they would cut you off manually.
I don't think anyone believed that threatening to take a human life was literally the only prompt that worked. Just that it was the first one this particular user found, and that is funny.
reply