<- Back to leaderboard
Coding
Model
Score
95% Confidence
1st
o1-mini
1265
+40 / -32
2nd
o1-preview
1195
+32 / -32
3rd
Claude 3.5 Sonnet (June 2024)
1115
+24 / -24
4
GPT-4o (August 2024)
1086
+28 / -31
5
GPT-4o (May 2024)
1076
+26 / -26
6
GPT-4 Turbo Preview
1074
+22 / -23
7
Gemini 1.5 Pro (August 27, 2024)
1073
+29 / -29
8
Mistral Large 2
1072
+28 / -27
9
Llama 3.1 405B Instruct
1062
+25 / -25
10
Gemini 1.5 Pro (May 2024)
1022
+27 / -24
11
Llama 3.2 90B Vision Instruct
1020
+30 / -34
12
Claude 3 Opus
995
+22 / -23
13
Gemini 1.5 Flash
972
+27 / -25
14
Gemini 1.5 Pro (April 2024)
931
+27 / -30
15
Claude 3 Sonnet
916
+27 / -29
16
Llama 3 70B Instruct
912
+24 / -25
17
Mistral Large
852
+28 / -28
18
Gemini 1.0 Pro
726
+33 / -33
19
CodeLlama 34B Instruct
636
+37 / -39