<- Back to leaderboard
Visual-Language Understanding
Model
Score
Std. Deviation
1st
Gemini-2.0-Flash-Exp
39.70
+0.78 / -0.78
2nd
Claude 3.5 Sonnet (October 2024)
38.41
+0.65 / -0.65
3rd
Claude 3.5 Sonnet (June 2024)
37.88
+0.61 / -0.61
4
ChatGPT-4o-latest (November 2024)
37.83
+0.38 / -0.38
5
Gemini 1.5 Pro
36.70
+1.22 / -1.22
6
GPT-4o (August 2024)
34.78
+0.22 / -0.22
7
Pixtral Large (November 2024)
33.83
+0.65 / -0.65
8
Gemini-1.5-Flash-002
33.63
+1.33 / -1.33
9
Claude 3 Opus
27.58
+0.50 / -0.50
10
Pixtral 12B (September 2024)
25.75
+0.57 / -0.57
11
Llama 3.2 90B Vision Instruct
24.40
+0.75 / -0.75
12
Llama 3.2 11B Vision-Instruct
20.20
+0.11 / -0.11
13
Phi 3.5 Vision-Instruct
15.11
+0.71 / -0.71