BlogResearch

How Calibrated Are OpenAI’s o3 and o4-mini? A Deep Dive Using Humanity’s Last Exam

by Ziwen Han, Dean Lee, Meher Mankikar, Edward Gan and Summer Yue

How Calibrated Are OpenAI’s o3 and o4-mini? A Deep Dive Using Humanity’s Last Exam
Published
Reading Time9 min read