AIBullisharXiv – CS AI · Apr 77/10
🧠Researchers have developed a neuro-symbolic framework that enables robots to learn complex manipulation tasks from as few as one demonstration, without requiring manual programming or large datasets. The system uses Vision-Language Models to automatically construct symbolic planning domains and has been validated on real industrial equipment including forklifts and robotic arms.
AIBullisharXiv – CS AI · Feb 277/106
🧠Researchers developed a hierarchical multi-agent LLM framework that significantly improves multi-robot task planning by combining natural language processing with classical PDDL planners. The system uses prompt optimization and meta-learning to achieve success rates of up to 95% on compound tasks, outperforming previous state-of-the-art methods by substantial margins.
$COMP
AINeutralarXiv – CS AI · Jun 56/10
🧠Researchers introduce SPG-LLM, a novel approach that leverages large language models to optimize the grounding process in classical planning by identifying irrelevant objects and actions before computation. The method achieves significantly faster grounding times—often by orders of magnitude—across seven challenging benchmarks while maintaining or improving plan quality.
AIBullisharXiv – CS AI · Jun 26/10
🧠Researchers have developed an automated method to generate PDDL planning problems directly from Asset Administration Shell (AAS) capability models using Industry 4.0 standards, eliminating the need for specialized planning expertise. This approach enables production engineers to design and verify manufacturing system layouts without requiring knowledge of formal planning languages, significantly reducing barriers to adopting automated planning in industrial settings.
AIBullisharXiv – CS AI · May 116/10
🧠Researchers present an end-to-end framework that uses Large Language Models to convert natural language specifications into PDDL planning models, with iterative refinement through hardcoded and dynamic agents, then generates executable plans. The system demonstrates strong performance across multiple domains including classic planning problems where LLMs typically struggle, and integrates with established planning engines.
🧠 Gemini
AINeutralarXiv – CS AI · Mar 266/10
🧠Researchers propose DUPLEX, a dual-system architecture that restricts LLMs to information extraction rather than end-to-end planning, using symbolic planners for logical synthesis. The system demonstrated superior performance across 12 planning domains by leveraging LLMs for semantic grounding while avoiding their hallucination tendencies in complex reasoning tasks.
AINeutralarXiv – CS AI · Mar 176/10
🧠Researchers propose a hierarchical planning framework to analyze why LLM-based web agents fail at complex navigation tasks. The study reveals that while structured PDDL plans outperform natural language plans, low-level execution and perceptual grounding remain the primary bottlenecks rather than high-level reasoning.
AIBullisharXiv – CS AI · Mar 115/10
🧠Researchers present GenePlan, a framework that uses large language models with evolutionary algorithms to generate domain-specific planners for classical planning tasks in PDDL. The system achieved a 0.91 SAT score across eight benchmark domains, nearly matching state-of-the-art performance while significantly outperforming other LLM-based approaches.
🧠 GPT-4
AINeutralarXiv – CS AI · Mar 94/10
🧠Researchers developed PyPDDLEngine, an open-source tool that allows large language models to perform task planning through interactive PDDL simulation. Testing on 102 planning problems showed agentic LLM planning achieved 66.7% success versus 63.7% for direct LLM planning, but at 5.7x higher token cost, while classical planning methods reached 85.3% success.
🧠 Claude🧠 Haiku