AIBullisharXiv – CS AI · 9h ago6/10
🧠
Personal AI Agent for Camera Roll VQA
Researchers introduce camroll, a dataset and AI agent system designed to answer questions about personal photo libraries by retrieving and analyzing relevant images from users' camera rolls. The camroll-agent uses hierarchical memory and specialized tools to handle long-context visual reasoning across thousands of personalized images, outperforming existing baselines in understanding user-specific visual content.