Multimodal Reasoning
6.30
+1.02 / -1.02
5.65
+0.27 / -0.27
4.23
+0.23 / -0.23
4.07
+0.40 / -0.40
3.18
+0.14 / -0.14
2.26
+0.32 / -0.32
1.10
+0.09 / -0.09
0.91
+0.08 / -0.08
0.84
+0.10 / -0.10
0.69
+0.21 / -0.21
0.82
+0.02 / -0.02
0.80
+0.06 / -0.06
0.63
+0.12 / -0.12
0.38
+0.03 / -0.03
Rank (UB): 1 + the number of models whose lower CI bound exceeds this model’s upper CI bound.