y0news
#interactive-ai2 articles
2 articles
AINeutralarXiv โ€“ CS AI ยท 4h ago2
๐Ÿง 

LifeEval: A Multimodal Benchmark for Assistive AI in Egocentric Daily Life Tasks

Researchers introduce LifeEval, a new multimodal benchmark designed to evaluate how well AI assistants can help humans in real-time daily life tasks from a first-person perspective. The benchmark reveals significant challenges for current AI models in providing timely and adaptive assistance in dynamic environments.

AINeutralarXiv โ€“ CS AI ยท 4h ago2
๐Ÿง 

A Comprehensive Evaluation of LLM Unlearning Robustness under Multi-Turn Interaction

Researchers found that machine unlearning in large language models, which aims to remove specific training data influence, is less effective in interactive settings than previously thought. Knowledge that appears forgotten in static tests can often be recovered through multi-turn conversations and self-correction interactions.