Calibration of OpenAI o3 and o4-mini on Humanity's Last Exam | Scale AI
Scale AI logo
Book a Demo
→
Log In
←
Blog
Research
How Calibrated Are OpenAI’s o3 and o4-mini? A Deep Dive Using Humanity’s Last Exam
by
Ziwen Han
,
Dean Lee
,
Meher Mankikar
,
Edward Gan
and
Summer Yue
Published
April 17, 2025
Reading Time
6 min read
Copy Link
Products
Research
Enterprise
Government
Customers
Resources