AIBullishOpenAI News ยท Apr 186/105
๐ง
Evolved Policy Gradients
Researchers have released Evolved Policy Gradients (EPG), an experimental metalearning approach that evolves the loss function of AI learning agents to enable faster training on new tasks. The method allows agents to generalize beyond their training data, successfully performing basic tasks in novel scenarios they weren't specifically trained for.