BlogResearch

How Calibrated Are OpenAI’s o3 and o4-mini? A Deep Dive Using Humanity’s Last Exam

by Ziwen Han, Dean Lee, Meher Mankikar, Edward Gan and Summer Yue

9 min read
How Calibrated Are OpenAI’s o3 and o4-mini? A Deep Dive Using Humanity’s Last Exam
No content available