🧠 AI⚪ NeutralImportance 6/10

The Value Sensitivity Gap: How Clinical Large Language Models Respond to Patient Preference Statements in Shared Decision-Making

arXiv – CS AI|Sanjay Basu|March 3, 2026 at 05:00 AM|7 views

🤖AI Summary

A research study evaluated how four major large language models (GPT-5.2, Claude 4.5 Sonnet, Gemini 3 Pro, and DeepSeek-R1) respond to patient preferences in clinical decision-making scenarios. While all models acknowledged patient values, they showed modest actual recommendation shifting with value sensitivity indices ranging from 0.13 to 0.27, revealing gaps in how AI systems incorporate patient preferences into medical recommendations.

Key Takeaways

→Four major LLM families showed significant variation in default clinical aggressiveness levels, ranging from 2.0 to 3.5 on a 5-point scale.
→All models acknowledged patient values in 100% of non-control trials, but actual recommendation changes remained limited.
→Value sensitivity indices were relatively low across all models, ranging from 0.13 to 0.27.
→Decision-matrix and VIM self-report mitigations each improved directional concordance by 0.125 in testing.
→The study provides empirical data for value disclosure labels proposed by clinical AI governance frameworks.

#large-language-models #clinical-ai #healthcare #patient-preferences #ai-governance #medical-decision-making #llm-evaluation

Read Original →via arXiv – CS AI

Act on this with AI

Stay ahead of the market.

Connect your wallet to an AI agent. It reads balances, proposes swaps and bridges across 15 chains — you keep full control of your keys.

Connect Wallet to AI →How it works

AI2h ago

CertiK warns AI misuse and infrastructure gaps to drive 2026 crypto hacks

AI16h ago

Katie Dill: Stripe’s homepage redesign reflects its growth, 78% of Forbes AI 50 rely on its products, and the importance of clarity in web design | Y Combinator Startup Podcast

AI21h ago

The Value Sensitivity Gap: How Clinical Large Language Models Respond to Patient Preference Statements in Shared Decision-Making

CertiK warns AI misuse and infrastructure gaps to drive 2026 crypto hacks

Katie Dill: Stripe’s homepage redesign reflects its growth, 78% of Forbes AI 50 rely on its products, and the importance of clarity in web design | Y Combinator Startup Podcast

Tencent joins Alibaba in pursuit of DeepSeek stake at $20 billion-plus valuation