y0news
AnalyticsDigestsRSSAICrypto
#lifebench1 article
1 articles
AINeutralarXiv โ€“ CS AI ยท 5h ago
๐Ÿง 

LifeBench: A Benchmark for Long-Horizon Multi-Source Memory

Researchers introduce LifeBench, a new AI benchmark that tests long-term memory systems by requiring integration of both declarative and non-declarative memory across extended timeframes. Current state-of-the-art memory systems achieve only 55.2% accuracy on this challenging benchmark, highlighting significant gaps in AI's ability to handle complex, multi-source memory tasks.