AINeutralOpenAI News · Oct 226/106
🧠
Learning complex goals with iterated amplification
Researchers propose iterated amplification, a new AI safety technique that allows specification of complex behaviors beyond human scale by demonstrating task decomposition rather than using labeled data or reward functions. The approach is in early experimental stages with testing limited to simple algorithmic domains, but shows potential as a scalable AI safety solution.