AIBullishOpenAI News ยท Jan 277/107
๐ง
Aligning language models to follow instructions
OpenAI has developed InstructGPT models that significantly improve upon GPT-3's ability to follow user instructions while being more truthful and less toxic. These models use human feedback training and alignment research techniques, and have been deployed as the default language models on OpenAI's API.