![](https://model-eval-leaderboard-git-main-scaleai.vercel.app/_next/image?url=%2Fassets%2Fseal-logo-gradient.png&w=256&q=75)
Leaderboards
Expert-Driven Private Evaluations
![](https://model-eval-leaderboard-git-main-scaleai.vercel.app/_next/image?url=%2Fassets%2Fdatasets.png&w=256&q=75)
Private Datasets
Scale’s proprietary, private evaluation datasets can’t be gamed, ensuring unbiased and uncontaminated results.
![](https://model-eval-leaderboard-git-main-scaleai.vercel.app/_next/image?url=%2Fassets%2Fcompetition.png&w=256&q=75)
Evolving Competition
We periodically update leaderboards with new datasets and models, fostering a dynamic, contest-like environment.
![](https://model-eval-leaderboard-git-main-scaleai.vercel.app/_next/image?url=%2Fassets%2Fevaluations.png&w=256&q=75)
Expert Evaluations
Our evaluations are performed by thoroughly vetted experts using domain specific methodologies, ensuring the highest quality and credibility.
Learn more about our evaluation methodology here →
Coding→
Learn More
Math→
Learn More
Instruction Following→
Learn More
If you’d like to add your model to this leaderboard or a future version, please contact seal@scale.com. To ensure leaderboard integrity, we require that models can only be featured the FIRST TIME when an organization encounters the prompts.