AINeutralarXiv – CS AI · Feb 276/106
🧠
TherapyProbe: Generating Design Knowledge for Relational Safety in Mental Health Chatbots Through Adversarial Simulation
Researchers introduce TherapyProbe, a methodology to identify relational safety failures in mental health chatbots through adversarial simulation. The study reveals dangerous interaction patterns like 'validation spirals' and creates a Safety Pattern Library with 23 failure archetypes and design recommendations.