<- Back to leaderboard
Spanish
Model
Score
95% Confidence
1st
Gemini 1.5 Pro (November 2024)
1146
+38 / -38
2nd
o1-preview
1121
+28 / -26
3rd
GPT-4o (May 2024)
1107
+27 / -23
4
Gemini 1.5 Pro (August 27, 2024)
1105
+28 / -26
5
Gemini 1.5 Pro (May 2024)
1090
+25 / -26
6
GPT-4 (November 2024)
1084
+33 / -32
7
GPT-4o (August 2024)
1079
+29 / -26
8
GPT-4 Turbo Preview
1049
+23 / -21
9
Mistral Large 2
1045
+25 / -28
10
Gemini 1.5 Pro (April 2024)
1027
+31 / -31
11
Claude 3.5 Sonnet (June 2024)
1016
+26 / -25
12
Gemini 1.5 Flash
1005
+26 / -27
13
Gemma 2 27B
981
+28 / -25
14
Llama 3.2 90B Vision Instruct
970
+26 / -26
15
Llama 3.3 70B Instruct
957
+36 / -35
16
Claude 3 Opus
945
+22 / -22
17
Llama 3.1 405B Instruct
940
+25 / -23
18
Llama 3 70B Instruct
906
+27 / -28
19
Mistral Large
871
+28 / -29
20
Gemini 1.0 Pro
870
+29 / -27
21
Claude 3 Sonnet
870
+29 / -33
22
Aya 23 35B*
824
+29 / -31
Note that this is NOT the newest Aya model and we are actively working on evaluating Aya Expanse