How Easy Is It to Trick an AI? Notes from a Red Team Competition
medium.comAuthor here, just sharing my initial experiences. Surprised at how easy seems to be to bypass guardrails, and that Claude is willing to help.
Happy to discuss if someone's more knowledgeable and share more