scale logo
<- Back to leaderboard

Humanity's Last Exam (Text Only)

Models evaluated on text-only HLE questions

Rank (UB): 1 + the number of models whose lower CI bound exceeds this model’s upper CI bound.

-