Scale Logo
SEAL Logo

Humanity's Last Exam

Challenging LLMs at the frontier of human knowledge

Last updated: April 11, 2025

Performance Comparison

1

18.16 ±1.51

2

8.12 ±1.07

2

8.04 ±1.07

2

7.96 ±1.06

2

6.56 ±0.97

2

5.68 ±0.91

5

5.44 ±0.89

5

5.40 ±0.89

6

4.60 ±0.82

7

Nova Pro

4.40 ±0.80

7

4.08 ±0.78

8

Nova Lite

3.64 ±0.73

10

2.72 ±0.64