AIBullisharXiv – CS AI · 4d ago7/10
🧠Researchers used large language models and evolutionary search to create the first domain-independent heuristics for symbolic AI planning that surpass hand-engineered baselines. These evolved heuristics, written in C++, solve more planning tasks than existing state-of-the-art approaches and maintain the soundness guarantees of traditional planners.
AIBullisharXiv – CS AI · Apr 147/10
🧠Researchers propose VaCoAl, a hyperdimensional computing architecture that combines sparse distributed memory with Galois-field algebra to address limitations in modern AI systems like catastrophic forgetting and the binding problem. The deterministic system demonstrates emergent properties equivalent to spike-timing-dependent plasticity and achieves multi-hop reasoning across 25.5M paths in knowledge graphs, positioning it as a complementary third paradigm to large language models.
AINeutralarXiv – CS AI · Mar 267/10
🧠Researchers propose a new symbolic-mechanistic approach to evaluate AI models that goes beyond accuracy metrics to detect whether models truly generalize or rely on shortcuts like memorization. Their method combines symbolic rules with mechanistic interpretability to reveal when models exploit patterns rather than learn genuine capabilities, demonstrated through NL-to-SQL tasks where a memorization model achieved 94% accuracy but failed true generalization tests.
AINeutralarXiv – CS AI · Mar 177/10
🧠Researchers propose shifting from large monolithic AI models to domain-specific superintelligence (DSS) societies due to unsustainable energy costs and physical constraints of current generative AI scaling approaches. The alternative involves smaller, specialized models working together through orchestration agents, potentially enabling on-device deployment while maintaining reasoning capabilities.
AIBullisharXiv – CS AI · Mar 97/10
🧠Researchers developed Localized In-Context Learning (L-ICL), a technique that significantly improves large language model performance on symbolic planning tasks by targeting specific constraint violations with minimal corrections. The method achieves 89% valid plan generation compared to 59% for best baselines, representing a major advancement in LLM reasoning capabilities.
AINeutralarXiv – CS AI · Mar 47/104
🧠Researchers have introduced SorryDB, a dynamic benchmark for evaluating AI systems' ability to prove mathematical theorems using the Lean proof assistant. The benchmark draws from 78 real-world formalization projects and addresses limitations of static benchmarks by providing continuously updated tasks that better reflect community needs.
AINeutralarXiv – CS AI · 5d ago5/10
🧠Researchers propose Generalized Holographic Reduced Representations (GHRR), an advancement in Hyperdimensional Computing that improves how complex data structures are encoded through a flexible, non-commutative binding operation. The framework demonstrates enhanced performance when applied to transformer models, suggesting potential efficiency improvements for AI systems that bridge symbolic and connectionist approaches.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers extend bounded fitting—a machine learning paradigm for logical formula discovery—to more expressive description logics beyond ALC, maintaining PAC-style guarantees while implementing practical solutions via SAT solvers. The work demonstrates that this approach scales to complex logical systems with inverse roles and qualified restrictions, achieving competitive results against existing concept learners.
AINeutralarXiv – CS AI · May 116/10
🧠Researchers have developed a parallel lifted planning algorithm using semi-naive Datalog evaluation that significantly accelerates classical AI planning by combining rule-level and grounding-level parallelism. The approach achieves up to 6-fold speedup on 8 cores and solves more planning tasks than existing baselines, particularly on computationally intensive grounding operations.
AINeutralarXiv – CS AI · May 76/10
🧠Researchers introduce NoisyCausal, a benchmark for testing how well large language models handle causal reasoning when presented with noisy, incomplete, or misleading information. The study proposes a modular framework combining LLMs with explicit causal graph structures, demonstrating significant improvements over standard prompting approaches and better generalization across external benchmarks.
AINeutralarXiv – CS AI · Apr 156/10
🧠Researchers propose a semantic bootstrapping framework that transfers knowledge from large language models into interpretable symbolic Tsetlin Machines, enabling text classification systems to achieve BERT-comparable performance while remaining fully transparent and computationally efficient without runtime LLM dependencies.
AINeutralarXiv – CS AI · Apr 146/10
🧠Researchers introduce Object-Oriented World Modeling (OOWM), a framework that structures LLM reasoning for robotic planning by replacing linear text with explicit symbolic representations using UML diagrams and object hierarchies. The approach combines supervised fine-tuning with group relative policy optimization to achieve superior planning performance on embodied tasks, demonstrating that formal software engineering principles can enhance AI reasoning capabilities.
AINeutralarXiv – CS AI · Apr 136/10
🧠Researchers present a novel approach using agentic language model feedback frameworks to generate planning domains from natural language descriptions augmented with symbolic information. The method employs heuristic search over model space optimized by various feedback mechanisms, including landmarks and plan validator outputs, to improve domain quality for practical deployment.
AIBullisharXiv – CS AI · Mar 96/10
🧠Researchers propose Hybrid Hierarchical RL (H²RL), a new framework that combines symbolic logic with deep reinforcement learning to address misalignment issues in AI agents. The method uses logical option-based pretraining to improve long-horizon decision-making and prevent agents from over-exploiting short-term rewards.
AIBullisharXiv – CS AI · Mar 44/102
🧠Researchers propose Symbolic Reward Machines (SRMs) as an improvement over traditional Reward Machines in reinforcement learning, eliminating the need for manual user input while maintaining performance. SRMs process observations directly through symbolic formulas, making them more applicable to widely adopted RL frameworks.