<- Back to leaderboard
Spanish
Model
Score
95% Confidence
1st
Gemini Pro Flash 2
1149
+37 / -32
2nd
Gemini 1.5 Pro (November 2024)
1121
+36 / -33
3rd
o1 Preview
1120
+29 / -26
4
GPT-4o (May 2024)
1101
+25 / -27
5
Gemini 1.5 Pro (August 27, 2024)
1091
+26 / -25
6
Gemini 1.5 Pro (May 2024)
1088
+27 / -25
7
GPT-4o (August 2024)
1076
+27 / -25
8
GPT-4 (November 2024)
1070
+32 / -32
9
GPT-4 Turbo Preview
1042
+22 / -21
10
Mistral Large 2
1041
+26 / -25
11
Gemini 1.5 Pro (April 2024)
1027
+29 / -32
12
Aya Expanse 32B
1012
+31 / -35
13
Claude 3.5 Sonnet (June 2024)
1011
+25 / -25
14
Gemini 1.5 Flash
999
+26 / -26
15
Gemma 2 27B
972
+25 / -24
16
Llama 3.2 90B Vision Instruct
962
+26 / -27
17
Llama 3.3 70B Instruct
943
+34 / -37
18
Claude 3 Opus
939
+21 / -23
19
Llama 3.1 405B Instruct
935
+22 / -25
20
Llama 3 70B Instruct
902
+28 / -28
21
Claude 3 Sonnet
866
+28 / -30
22
Mistral Large
865
+29 / -29
23
Gemini 1.0 Pro
865
+28 / -29
24
Aya 23 35B*
814
+30 / -29