AINeutralarXiv – CS AI · 15h ago6/10
🧠
Persona2Web: Benchmarking Personalized Web Agents for Contextual Reasoning with User History
Researchers introduced Persona2Web, the first benchmark for evaluating personalized web agents that can infer user preferences from historical behavior rather than explicit instructions. The framework tests how large language models handle ambiguous queries by leveraging user context, addressing a critical gap in current web agent capabilities.