y0news
AnalyticsDigestsSourcesTopicsRSSAICrypto

#constraint-learning News & Analysis

7 articles tagged with #constraint-learning. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

7 articles
AIBullisharXiv – CS AI · May 127/10
🧠

NEXUS: Continual Learning of Symbolic Constraints for Safe and Robust Embodied Planning

Researchers introduce NEXUS, a framework enabling embodied AI agents to learn symbolic constraints for safer decision-making in physical environments. The system addresses the gap between probabilistic language models and the deterministic safety requirements of robotics by decoupling physical feasibility from safety specifications, achieving improved task success while refusing unsafe instructions.

AIBullisharXiv – CS AI · Apr 147/10
🧠

Learning and Enforcing Context-Sensitive Control for LLMs

Researchers introduce a framework that automatically learns context-sensitive constraints from LLM interactions, eliminating the need for manual specification while ensuring perfect constraint adherence during generation. The method enables even 1B-parameter models to outperform larger models and state-of-the-art reasoning systems in constraint-compliant generation.

AIBullisharXiv – CS AI · 4d ago6/10
🧠

Learning the Error Patterns of Language Models

Researchers propose Palla, an algorithm that learns symbolic constraint functions called prefix filters to capture and correct systematic error patterns in large language models. By analyzing domain-specific failures (e.g., using Python syntax in TypeScript code), Palla enables constrained sampling to significantly improve compilation rates and output validity without retraining models.

🧠 Llama
AINeutralarXiv – CS AI · 5d ago5/10
🧠

Managing Uncertainty in LLM-Generated Procedural Knowledge for Virtual Laboratory Planning

Researchers present a framework for managing uncertainty in language model-generated laboratory procedures for virtual educational environments. The system uses structured domain representations and LLM outputs to extract, validate, and repair procedural steps, addressing common LLM failures like missing actions, incorrect sequencing, and logical incompatibilities.

AINeutralarXiv – CS AI · 5d ago6/10
🧠

Auditing and Fixing Economic Validity in Tabular Foundation Models for Discrete Choice

Researchers propose a two-stage adapter that constrains tabular foundation model predictions within economic theory frameworks, ensuring price-demand relationships remain logically consistent while recovering accuracy gains over standard choice models. The approach achieves up to 13 percentage points of accuracy improvement on transportation datasets while guaranteeing economic validity—a problem raw foundation models fail to solve.

AINeutralarXiv – CS AI · May 116/10
🧠

Direct Reasoning Optimization: Token-Level Reasoning Reflectivity Meets Rubric Gates for Unverifiable Tasks

Researchers propose Direct Reasoning Optimization (DRO), a constrained reinforcement learning framework that improves LLM training on unverifiable tasks by combining token-level reasoning rewards with rubric-based feasibility gates. The approach demonstrates faster, more sample-efficient learning across scientific, medical, legal, and financial domains.

AINeutralarXiv – CS AI · Mar 27/1013
🧠

Learning to maintain safety through expert demonstrations in settings with unknown constraints: A Q-learning perspective

Researchers propose SafeQIL, a new Q-learning algorithm that learns safe policies from expert demonstrations in constrained environments where safety constraints are unknown. The approach balances maximizing task rewards while maintaining safety by learning from demonstrated trajectories that successfully complete tasks without violating hidden constraints.