←BlogResearch
How Calibrated Are OpenAI’s o3 and o4-mini? A Deep Dive Using Humanity’s Last Exam
by Ziwen Han, Dean Lee, Meher Mankikar, Edward Gan and Summer Yue
Published
Reading Time9 min read
by Ziwen Han, Dean Lee, Meher Mankikar, Edward Gan and Summer Yue