y0news
AnalyticsDigestsSourcesRSSAICrypto
#general-ai2 articles
2 articles
AIBullisharXiv โ€“ CS AI ยท Feb 277/107
๐Ÿง 

General Agent Evaluation

Researchers have developed Exgentic, a new framework for evaluating general-purpose AI agents that can perform tasks across different environments without domain-specific tuning. The study benchmarked five prominent agent implementations and found that general agents can achieve performance comparable to specialized agents, establishing the first Open General Agent Leaderboard.

AIBullishGoogle DeepMind Blog ยท Dec 47/106
๐Ÿง 

Genie 2: A large-scale foundation world model

Genie 2 is introduced as a large-scale foundation world model designed to generate unlimited diverse training environments. This development aims to support the creation and training of future general AI agents by providing varied simulation scenarios.