AINeutralarXiv – CS AI · May 96/10
🧠
PersonaTeaming: Supporting Persona-Driven Red-Teaming for Generative AI
PersonaTeaming introduces a persona-driven approach to red-teaming generative AI systems, combining automated adversarial prompt generation with human-in-the-loop collaboration. The method outperforms existing automated approaches while enabling security researchers to leverage diverse perspectives and backgrounds to uncover AI model vulnerabilities more effectively.