AIBullisharXiv – CS AI · Jun 17/10
🧠GSAM is a new robotic framework that improves articulated object manipulation through vision-based perception, VLM-based refinement with commonsense reasoning, and constraint-based planning to prevent collisions. In experiments across 50 hinge tasks, GSAM achieved 36% higher success rates and 3.1% lower standard deviation compared to existing baselines, demonstrating superior generalization and safety.
AINeutralarXiv – CS AI · May 117/10
🧠Researchers present MORPH-U, a simulation-based autonomous driving system that integrates Vehicle-to-Everything (V2X) communication with LiDAR/radar/camera sensors while implementing Byzantine-inspired safeguards against forged or delayed messages. The framework uses multi-objective optimization to balance safety, comfort, and responsiveness in high-uncertainty environments, demonstrating resilience against coordinated false-message attacks.
AIBullisharXiv – CS AI · Mar 57/10
🧠Researchers developed CoCo-TAMP, a robot planning framework that uses large language models to improve state estimation in partially observable environments. The system leverages LLMs' common-sense reasoning to predict object locations and co-locations, achieving 62-73% reduction in planning time compared to baseline methods.
AINeutralarXiv – CS AI · May 296/10
🧠ScheduleStream introduces a GPU-accelerated framework for Task and Motion Planning & Scheduling (TAMPAS) that enables bimanual and humanoid robots to coordinate parallel arm movements efficiently. The system models temporal dynamics through hybrid durative actions and produces more optimized schedules than traditional TAMP algorithms that typically move one arm at a time.
AINeutralarXiv – CS AI · May 296/10
🧠Researchers present a systematic review of Data-Driven Optimal Control (DDOC), a framework that integrates machine learning with traditional control theory for autonomous driving motion planning. The approach aims to bridge the gap between rule-based systems' safety guarantees and learning-based methods' adaptability, proposing implementation across three dimensions: customization, dynamics adaptation, and self-tuning.
AINeutralarXiv – CS AI · May 286/10
🧠Researchers introduce Simulation-Informed Diffusion (SID), a decentralized multi-robot motion planning framework that predicts neighboring robot trajectories to enable collision-free path planning without global communication. The approach scales to 108 robots and 160 obstacles while triggering coordination only when necessary, outperforming existing classical and learning-based planners.
AINeutralarXiv – CS AI · May 126/10
🧠Researchers introduce REAP, a reinforcement learning-based autonomous parking system that uses Gaussian Splatting to simulate real-world environments for training, then transfers the model to physical vehicles. The method addresses limitations of traditional multi-stage parking approaches by jointly optimizing perception and planning, achieving successful parking in extreme scenarios like mechanical slots.
AIBullisharXiv – CS AI · May 126/10
🧠VECTOR-Drive introduces a tightly coupled vision-language-action framework for autonomous driving that balances semantic reasoning with motion planning through expert routing. Built on Qwen2.5-VL-3B, the system achieves 88.91 Driving Score on Bench2Drive by routing vision-language tokens to semantic experts while handling trajectory computation separately, demonstrating advances in multimodal AI for real-world driving tasks.
AIBullisharXiv – CS AI · Mar 96/10
🧠Researchers developed XR-DT, an Extended Reality-enhanced Digital Twin framework that combines augmented, virtual, and mixed reality to improve human-robot interaction in shared workspaces. The system uses a novel Human-Aware Model Predictive Path Integral control model with ATLAS, a Transformer-based trajectory prediction system, to enable safer and more interpretable robot navigation around humans.
AIBullisharXiv – CS AI · Mar 36/107
🧠Researchers developed an open-source modular benchmark for evaluating diffusion-based motion planners in real-world autonomous driving systems. The system integrates with Autoware ROS 2 stack and achieves 3.2x latency reduction through encoder caching while improving accuracy by 41% with second-order solving.
AINeutralarXiv – CS AI · Mar 264/10
🧠Researchers have published a comprehensive review analyzing state-of-the-art neural motion planners for robotic manipulators, highlighting their benefits in fast inference but limitations in generalizing to unseen environments. The paper outlines a path toward developing generalist neural motion planners that could better handle domain-specific challenges in cluttered, real-world environments.
AINeutralarXiv – CS AI · Mar 164/10
🧠Researchers evaluated four state-of-the-art Vision-Language Models (VLMs) on their ability to perform spatial reasoning for robot motion planning. Qwen2.5-VL achieved the highest performance at 71.4% accuracy zero-shot and 75% after fine-tuning, while GPT-4o showed lower performance in handling motion preferences and spatial constraints.
🧠 GPT-4