🧠 AI🟢 BullishImportance 7/10

Learning from human preferences

OpenAI News|June 13, 2017 at 07:00 AM|7 views

🤖AI Summary

OpenAI and DeepMind have collaborated to develop an algorithm that can learn human preferences by comparing two proposed behaviors, eliminating the need for humans to manually write goal functions. This approach aims to reduce dangerous AI behavior that can result from oversimplified or incorrect goal specifications.

Key Takeaways

→OpenAI partnered with DeepMind's safety team to develop preference learning algorithms.
→The new approach removes the need for humans to write explicit goal functions for AI systems.
→Simple proxy goals or incorrectly specified complex goals can lead to dangerous AI behavior.
→The algorithm learns by being shown pairs of behaviors and told which is preferred.
→This represents a step toward building safer AI systems through preference learning.