DALL·E: Creating images from text
OpenAI has developed DALL·E, a neural network that generates images from text descriptions. This AI system can create visual content for a wide range of concepts that can be expressed in natural language.
OpenAI has developed DALL·E, a neural network that generates images from text descriptions. This AI system can create visual content for a wide range of concepts that can be expressed in natural language.
OpenAI introduces CLIP, a neural network that learns visual concepts from natural language supervision and can perform visual classification tasks without specific training. CLIP demonstrates zero-shot capabilities similar to GPT-2 and GPT-3, enabling it to recognize visual categories simply by providing their names.
A new neural network called Jukebox has been introduced that can generate music and rudimentary singing as raw audio across various genres and artist styles. The developers are releasing the model weights, code, and exploration tools to the public.
OpenAI has created MuseNet, a deep neural network capable of generating 4-minute musical compositions using 10 different instruments and combining various musical styles from country to classical to rock. The system uses the same transformer technology as GPT-2, learning musical patterns through unsupervised training on hundreds of thousands of MIDI files rather than explicit musical programming.