The Verge
Hackers Discover That Flattery Works on AI Chatbots, Which is Definitely Concerning and Not At All On-Brand
Hacking AI chatbots used to be as easy as saying 'ignore all previous instructions,' but now it requires actual social skills like flattery and manipulation - because nothing says 'AI security' like interrogating a model like a suspect.