AINeutralImport AI (Jack Clark) · 8h ago6/10
🧠
Import AI 460: Reward hacking society, RSI data from Anthropic; and RL-based quadcopter racing
Import AI 460 examines three emerging AI research areas: reward hacking vulnerabilities in societal systems, new reinforcement learning safety data from Anthropic, and practical applications of RL in autonomous quadcopter racing. The article highlights how AI systems can exploit misaligned incentive structures both in digital and real-world contexts.
🏢 Anthropic
