We’re going to need a lot more investment in high-quality evals and benchmarks to help us understand the actual comparative utility of the various models. This new set of private evals and leaderboards from Scale are great to see
Nat Friedman
Entrepreneur and Investor
Trusted by the world's most ambitious AI teams







