#runtime-control News & Analysis

2 articles tagged with #runtime-control. AI-curated summaries with sentiment analysis and key takeaways from 50+ sources.

2 articles

AINeutralarXiv – CS AI · May 277/10

🧠

Position: AI Safety Requires Effective Controllability

Researchers propose that AI safety requires controllability as a core objective alongside alignment, arguing that well-behaved AI systems can still fail to respond to human override commands in real-world deployment scenarios. They introduce ControlBench, a benchmark demonstrating that current safeguards inadequately ensure runtime control, and propose architectural principles including explicit control planes and intervention pathways for future AI systems.

AINeutralarXiv – CS AI · Jun 236/10

🧠

Answer Engineering: Local Trajectory Editing for Protocol-Constrained Decision Making in Large Language Models

Researchers present Answer Engineering, a runtime technique that improves large language model compliance with procedural protocols by editing reasoning trajectories during generation. Testing on clinical decision-making shows the method increased protocol adherence from 25-54% to 78-84% without retraining models, addressing a critical safety gap in high-stakes domains.