scale logo
<- Back to leaderboard

EnigmaEval

Multimodal Reasoning

Rank (UB): 1 + the number of models whose lower CI bound exceeds this model’s upper CI bound.