AIBullishOpenAI News ยท Sep 257/108
๐ง
Measuring the performance of our models on real-world tasks
OpenAI has launched GDPval, a new evaluation framework designed to measure AI model performance on economically valuable real-world tasks across 44 different occupations. This represents a shift toward assessing AI capabilities based on practical economic impact rather than traditional benchmarks.