AIBearisharXiv – CS AI · 10h ago6/10
🧠
Beyond Continuity: Challenges of Context Switching in Multi-Turn Dialogue with LLMs
Researchers tested how well Large Language Models handle multi-turn conversations with topic shifts, finding that most LLMs struggle to detect when users pivot to new topics and incorrectly carry over irrelevant context from previous exchanges. The study reveals that only advanced reasoning models and strongly instructed LLMs perform accurately, while open-weight models frequently fail even with explicit cues, highlighting a critical robustness gap in production LLM deployments.