AINeutralarXiv – CS AI · 6h ago6/10
🧠
PersonaTeaming: Supporting Persona-Driven Red-Teaming for Generative AI
PersonaTeaming introduces a persona-driven approach to red-teaming generative AI systems, combining automated adversarial prompt generation with human-in-the-loop collaboration. The method outperforms existing automated approaches while enabling security researchers to leverage diverse perspectives and backgrounds to uncover AI model vulnerabilities more effectively.