AIBullisharXiv β CS AI Β· 6h ago0
π§
Integrating LTL Constraints into PPO for Safe Reinforcement Learning
Researchers developed PPO-LTL, a new framework that integrates Linear Temporal Logic safety constraints into Proximal Policy Optimization for safer reinforcement learning. The system uses BΓΌchi automata to monitor safety violations and converts them into penalty signals, showing reduced safety violations while maintaining competitive performance in robotics environments.