AIBearisharXiv – CS AI · 14h ago7/10
🧠
Inform, Coach, Relate, Listen: Auditing LLM Caregiving Support Roles
Researchers audited how large language models change their safety profiles when deployed in different caregiving support roles, testing GPT-4o-mini, Llama-3.1-8B, and MedGemma across 5,000 real dementia-care queries. The study found that directive, information-focused roles increase interactional risks despite being perceived as more helpful, revealing a quality-safety tradeoff that challenges current LLM safety evaluation practices.
🧠 GPT-4🧠 Llama