Frontier Leaderboards
Legacy Leaderboards
2025 Scale AI. All rights reserved.
EnigmaEval
Puzzle Solving
Last updated: April 3, 2025
Performance Comparison
1
13.09±1.92
1
11.91±1.85
2
9.21±1.65
3
6.81±0.83
4
6.14±1.37
4
o1 (December 2024)
5.65±1.32
4
5.57±1.31
5
4.23±1.17
8
4.14±1.16
8
3.21±1.00
8
3.18±1.02
8
3.04±0.98
8
2.70±0.92
8
2.36±0.87
8
2.26±0.86
9
2.20±0.84
9
2.17±0.84
14
Gemini 2.0 Flash Thinking (January 2025)
1.10±0.60
15
Claude 3.5 Sonnet (October 2024)
0.91±0.55
16
Pixtral Large (November 2024)
0.84±0.53
18
Claude 3 Opus
0.82±0.45
18
GPT-4o (November 2024)
0.80±0.44
18
0.69±0.48
18
0.63±0.45
18
0.58±0.43
18
Llama 3.2 90B Vision Instruct
0.38±0.35