AIBullisharXiv – CS AI · Apr 77/10
🧠MemMachine is an open-source memory system for AI agents that preserves conversational ground truth and achieves superior accuracy-efficiency tradeoffs compared to existing solutions. The system integrates short-term, long-term episodic, and profile memory while using 80% fewer input tokens than comparable systems like Mem0.
🧠 GPT-4🧠 GPT-5
AIBullisharXiv – CS AI · Mar 177/10
🧠Justitia is a new scheduling system for task-parallel LLM agents that optimizes GPU server performance through selective resource allocation based on completion order prediction. The system uses memory-centric cost quantification and virtual-time fair queuing to achieve both efficiency and fairness in LLM serving environments.
🏢 Meta
AIBullishOpenAI News · Mar 117/10
🧠OpenAI has developed an agent runtime that transforms their Responses API from a simple model interface into a full computing environment. The system uses shell tools and hosted containers to enable secure, scalable AI agents that can manage files, execute tools, and maintain state.
🏢 OpenAI
AINeutralarXiv – CS AI · Mar 46/104
🧠Researchers analyzed memory systems in LLM agents and found that retrieval methods are more critical than write strategies for performance. Simple raw chunk storage matched expensive alternatives, suggesting current memory pipelines may discard useful context that retrieval systems cannot compensate for.
AINeutralarXiv – CS AI · May 96/10
🧠Skill1 presents a unified reinforcement learning framework that enables language model agents to co-evolve three coupled capabilities: skill selection, utilization, and distillation from a single task-outcome reward signal. Demonstrated improvements over existing baselines on complex tasks suggest advances in how AI agents can build and leverage persistent skill libraries across diverse problem domains.
AIBullisharXiv – CS AI · Mar 37/109
🧠Researchers introduce HiMAC, a hierarchical reinforcement learning framework that improves LLM agent performance on long-horizon tasks by separating macro-level planning from micro-level execution. The approach demonstrates state-of-the-art results across multiple environments, showing that structured hierarchy is more effective than simply scaling model size for complex agent tasks.
AINeutralHugging Face Blog · Sep 221/107
🧠The article title references Gaia2 and ARE as tools for community-driven agent research, but no article content was provided for analysis. Without the full article body, specific details about these platforms and their implications cannot be determined.