AINeutralarXiv โ CS AI ยท Feb 276/106
๐ง
TherapyProbe: Generating Design Knowledge for Relational Safety in Mental Health Chatbots Through Adversarial Simulation
Researchers introduce TherapyProbe, a methodology to identify relational safety failures in mental health chatbots through adversarial simulation. The study reveals dangerous interaction patterns like 'validation spirals' and creates a Safety Pattern Library with 23 failure archetypes and design recommendations.