February 4, 2026

Scale AI partners with Webster University to launch a technical writing certificate advancing AI workforce skills.
Read more
February 2, 2026

Scott O’Neill is a plumbing sales professional in Louisiana who contributes to building better AI models in his spare time. Between a full-time job and raising two young daughters, Scott uses flexible, remote work through Outlier to stay connected to technology, apply his problem-solving skills, and continue learning. His story reflects how people from diverse backgrounds are helping shape the future of AI on schedules that fit real life.
Read more
December 19, 2025

Agentic AI marks a shift from reactive chatbots to autonomous mission partners. Government must adopt unified Agentic Infrastructure—combining resilient agent execution and governed AgentOps—to enable machine-speed decisions. Platforms like Scale’s SGP and Agentex deliver interoperable, durable, and accountable autonomy for mission assurance.
Read more
April 25, 2025

Training AI models to behave responsibly in the real world means preparing them for the full range of online content — including the challenging parts. It’s not easy work, but it’s necessary. At Scale, we believe that building AI systems that avoid harmful, abusive, or dangerous behavior is one of the most important challenges of our time. And we’re proud to support the people who make this possible.
Read more
April 23, 2025

As part of Scale’s ongoing investment in its AI workforce in St. Louis, Scale and the University of Missouri-St. Louis (UMSL) are officially launching a collaborative education effort.
Read more
April 3, 2025

Since its inception in 2023, Outlier has become a cornerstone of the AI industry—connecting hundreds of thousands of people across the globe with meaningful and flexible work. Hailing from cities and small towns across the world, Outlier contributors have earned a combined hundreds of millions of dollars to help build the foundation of today’s most advanced AI models
Read more
April 2, 2025

Frontier AI development has reached an inflection point: as models rapidly advance in capabilities, the need for sophisticated evaluation has become a decisive factor in competitive success. That’s why today we're announcing updates to Scale Evaluation, our platform that helps teams identify model weaknesses and validate improvements. Our updated platform introduces four key capabilities: instant model comparison across thousands of tests, multi-dimensional performance visualization, automated error discovery, and targeted improvement guidance—all designed to help teams identify weaknesses faster and make more confident release decisions. These updates build on Scale Evaluation’s foundation introduced last year, broadening access to frontier evaluation capabilities.
Read more
March 26, 2025

Scale AI products have been approved for purchase on AWS Marketplace for the U.S. Intelligence Community (ICMP). ICMP is a digital catalog that makes it easy for customers in the U.S. national security community to find, test, buy, and deploy software that runs on AWS.
Read more
March 5, 2025

Scale is proud to have been awarded a prime contract by the Defense Innovation Unit (DIU) for Thunderforge - the DoD’s flagship program leveraging AI for military planning and wargaming. Thunderforge represents our commitment to advancing U.S. military capabilities. Following its initial deployment, Thunderforge will expand throughout combatant commands, leveraging Scale AI's agentic applications and GenAI evaluation expertise.
Read more
February 27, 2025

Scale AI, in collaboration with the Center for Strategic and International Studies (CSIS), is proud to introduce the Critical Foreign Policy Decision (CFPD) Benchmark—a pioneering effort to evaluate large language models (LLMs) on national security and foreign policy decision-making tendencies.
Read more
February 23, 2025

The Ministry of Communications and Information Technology (MCIT) and Scale AI, the leader in frontier AI solutions, are announcing a strategic, long-term partnership to drive Qatar’s digital transformation.
Read more
February 11, 2025

Scale researchers have discovered a groundbreaking method for AI safety testing called J2 (Jailbreaking to Jailbreak), where language models are taught to systematically test their own and other models' safety measures. This hybrid approach combines human-like strategic reasoning with automated scalability, achieving success rates of over 90% in vulnerability testing, nearly matching professional human red-teaming effectiveness. While highlighting significant advances in automated security testing, these findings also reveal important challenges for the future of AI safety.
Read more
February 11, 2025

Scale AI leads groundbreaking research to build safer, more capable AI systems through innovative approaches in post-training optimization, agent development, and evaluation frameworks. Their comprehensive work spans from improving model performance and reliability to developing robust safety measures, all while maintaining a commitment to open collaboration and industry-wide advancement. Through the Safety, Evaluations, and Alignment Lab (SEAL) and various research initiatives, Scale AI is shaping the future of responsible AI development.
Read more
February 10, 2025

Scale’s AISI-approved AI model evaluations are setting a new standard for pre-deployment testing. By offering voluntary, efficient, and third-party validated assessments, we are empowering AI developers to create more reliable models—without the complexities that typically slow down the process.
Read more
January 23, 2025

Scale AI and the Center for AI Safety (CAIS) are proud to publish the results of Humanity’s Last Exam, a groundbreaking new AI benchmark that was designed to test the limits of AI knowledge at the frontiers of human expertise.
Read more
January 3, 2025

As we return to work after the holiday break, the Scale AI Public Sector team wanted to reflect on our work heading into 2025. As strategic rivalries continue to intensify and adversaries form new alliances globally to challenge U.S. leadership in AI, the mission of Scale’s Public Sector team has never been more vital. We are dedicated to ensuring that the U.S. and its allies have the best technology to lead in this increasingly complex global landscape. The snapshot below captures a few key highlights from last year:
Read more
November 19, 2024

Microsoft Azure and Scale AI Collaborate to help Enterprises Deliver Powerful Agentic GenAI Solutions with Customized and Fine-Tuned Azure AI Models.
Read more
November 4, 2024

Scale AI is proud to announce Defense Llama, the Large Language Model (LLM) built on Meta’s Llama 3 that is specifically customized and fine-tuned to support American national security missions.
Read more
September 16, 2024

Scale AI and CAIS are excited to announce the launch of Humanity's Last Exam, a project aimed at measuring how close we are to achieving expert-level AI systems
Read more
July 23, 2024

Scale is proud to be a Llama 3.1 Launch Partner! Llama 3.1 405B is the largest openly available foundation model rivaling the best closed-source models. Meta and Scale partnered to help businesses customize, evaluate, and deploy Llama 3.1 405B for enterprise use cases using Scale GenAI Platform.
Read more
July 10, 2024

Amazon Web Services (AWS) names Scale AI as the first model customization and evaluation partner on Amazon Bedrock.
Read more
June 6, 2024

With the rapid advancement of AI model capabilities, it is necessary, now more than ever, to test and evaluate AI systems to ensure that it is safe to deploy for its intended use case. Scale AI is committed to promoting AI safety through our T&E offering, Scale Evaluation.
Read more
May 31, 2024

Today marks the one-year anniversary of Donovan, our pioneering mission-focused AI application designed to responsibly support the public sector. We look back at the AI capabilities that have been deployed and what Scale AI Public Sector has planned for the next year.
Read more
May 29, 2024

As a third-party model evaluator trusted by leading AI labs, Scale is excited to release the SEAL Leaderboards, which rank frontier LLMs using curated private datasets that can’t be gamed.
Read more
April 2, 2024

When Scale introduced Donovan, a large language model (LLM) platform built for the public sector, one of the top questions we were asked was, “Why Donovan?” Learn how Scale Donovan serves the public sector in the footsteps of its namesake.
Read more
February 20, 2024

Scale AI, the leading test and evaluation (T&E) partner for frontier artificial intelligence companies, is proud to share that we are partnering with the U.S. Department of Defense’s (DoD) Chief Digital and Artificial Intelligence Office (CDAO) to create a comprehensive T&E framework for the resp...
Read more
February 13, 2024

2023 ushered in a wave of excitement about Large Language Models (LLMs) and became the year of the Generative AI proof-of-concept. Enterprises experimented with Generative AI and explored how it may impact their business. According to BCG,
Read more
December 21, 2023

Scale AI and Austin Community College District (ACC) recently teamed up to host a hackathon that enabled participants to craft prototypes with practical applications using Donovan, Scale’s AI-powered digital staff assistant. The hackathon, held on December 12 at the ACC Rio Grande Campus ACCelerator
Read more
December 5, 2023

Autonomous vehicle development requires iterative improvements in perception models through a data engine. These data engines currently rely on a set of task-specific models based around a fixed taxonomy of objects and scenarios to identify. However, there are two critical limitations to existing...
Read more
November 14, 2023

Scale AI partners with CSIS to leverage LLMs for global challenges and national security. The collaboration focuses on strategic wargaming, decision support, and international relations. Together, they aim to advance insights and solutions for complex issues.
Read more
November 8, 2023

As the leading test and evaluation partner for frontier AI companies, Scale plays an integral role in understanding and safeguarding large language models (LLMs).
Read more
October 4, 2023

Read more
September 12, 2023

Today, Scale will sign onto the Biden-Harris Administration’s voluntary commitments to ensure that AI is safe, secure, and trustworthy. These commitments are critical to the future of AI.
Read more
August 11, 2023

Enabling effective and secure generative AI will require a comprehensive T&E offering, encompassing model evaluation, monitoring, and red teaming, as demonstrated in our approach to Test and Evaluation.
Read more
August 6, 2023

Read more
August 2, 2023

Read more
May 19, 2023

Scale AI has been selected by the U.S. Army and the Defense Innovation Unit (DIU) to deploy their Data Engine in support of the Army’s Robotic Combat Vehicle (RCV) program.
Read more
April 26, 2023

Read more
March 30, 2023

Read more
January 10, 2023

Read more
November 9, 2022

Read more
October 19, 2022

Read more
September 6, 2022

Read more
June 28, 2022

Read more
March 7, 2022

We have received an overwhelming response to our announcement of free, AI-ready datasets in support of Ukraine. Nearly a hundred AI companies, researchers, developers, and GIS practitioners have offered to support the initiative with Scale – an inspiring global community of AI Developers for good.
Read more
February 2, 2022

Introducing Scale Synthetic: Enhance ML performance with realistic synthetic data, overcoming the limitations of real-world data like privacy, bias, and scarcity.
Read more
December 21, 2021

Read more
November 30, 2021

Read more
November 8, 2021

AI is no longer the exclusive domain of large enterprises. It’s entering the business mainstream—but it’s successful only sometimes, says Salesforce President and Chief Operating Officer Bret Taylor.
Read more
November 3, 2021

At TransformX, we brought together a community of leaders, visionaries, practitioners, and researchers across industries to explore the shift from research to reality within Artificial Intelligence (AI) and Machine Learning (ML).
Read more
November 3, 2021

Read more
November 2, 2021

Read more
October 27, 2021

Read more
October 21, 2021

Read more
October 20, 2021

At TransformX, we brought together a community of leaders, visionaries, practitioners, and researchers across industries to explore the shift from research to reality within Artificial Intelligence (AI) and Machine Learning (ML).
Read more
October 19, 2021

Read more
October 19, 2021

Read more
September 13, 2021

Read more
June 23, 2021

Read more
June 21, 2021

Navigation errors are minor for individuals but costly for autonomous vehicles and logistics, impacting efficiency at scale.
Read more
April 7, 2021

Read more
September 16, 2020

Read more
August 5, 2020

Read more
May 20, 2020

Read more
April 20, 2020

The Scale AI team is excited to introduce a new look and new structure for our product line-up.
Read more
March 17, 2020

Read more
June 5, 2019

Read more