y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#compositional-ai News & Analysis

4 articles tagged with #compositional-ai. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

4 articles
AIBullisharXiv – CS AI · May 127/10
🧠

Flame3D: Zero-shot Compositional Reasoning of 3D Scenes with Agentic Language Models

Flame3D introduces a training-free framework that enables large language models to reason about 3D scenes compositionally without requiring 3D-specific training data. The system represents scenes as editable visual-textual memories and allows agents to synthesize custom spatial programs at inference time, achieving competitive results on existing benchmarks while opening new possibilities for multi-hop spatial reasoning.

AIBullisharXiv – CS AI · Mar 36/104
🧠

TTOM: Test-Time Optimization and Memorization for Compositional Video Generation

Researchers introduce TTOM (Test-Time Optimization and Memorization), a training-free framework that improves compositional video generation in Video Foundation Models during inference. The system uses layout-attention optimization and parametric memory to better align text prompts with generated video outputs, showing strong transferability across different scenarios.

AIBullisharXiv – CS AI · Mar 27/1011
🧠

Foundation World Models for Agents that Learn, Verify, and Adapt Reliably Beyond Static Environments

Researchers propose a new framework for foundation world models that enables autonomous agents to learn, verify, and adapt reliably in dynamic environments. The approach combines reinforcement learning with formal verification and adaptive abstraction to create agents that can synthesize verifiable programs and maintain correctness while adapting to novel conditions.

AINeutralarXiv – CS AI · Mar 34/106
🧠

CMI-RewardBench: Evaluating Music Reward Models with Compositional Multimodal Instruction

Researchers introduce CMI-RewardBench, a comprehensive evaluation framework for music generation AI models that can process multimodal inputs including text, lyrics, and audio. The system includes a 110k sample preference dataset and reward models that show strong correlation with human judgments for music quality assessment.