AIBullishOpenAI News · Dec 166/106
🧠
Evaluating AI’s ability to perform scientific research tasks
OpenAI has launched FrontierScience, a new benchmark designed to test AI systems' reasoning capabilities across physics, chemistry, and biology. The benchmark aims to measure AI progress toward conducting actual scientific research tasks.