Computer Use Environments
Train agents to navigate and operate realistic desktop and web environments, including macOS- and Windows-like systems.

Train and evaluate agents to excel at long-horizon, professional workflows.
Simulated APIs, MCP servers, and GUIs that behave like the systems agents actually use.
Files (PDFs, CSVs, PPTs, etc.), data, and state settings that reflect real professional workflows, sourced from real business settings, capturing natural messiness.
Expert-designed objectives, rubrics, and automated verifiers that produce stable training signals for programmatic process and outcome evaluation.
Compatible with standard environment interfaces, supporting trajectories, resetting state, examining rewards, and more.

Scale AI's RL Environments are simulated collections of realistic applications designed to train and evaluate agent behavior.

They mirror real computer and API-based systems, supporting rich logs and application state that can be used for programmatic and rubric-based evaluation.

Each environment combines simulated interfaces with expert-curated data and evaluators to produce reliable learning signals. Scale also provides analysis on complexity (pass@k) and anticipated training gains.
Train agents to navigate and operate realistic desktop and web environments, including macOS- and Windows-like systems.


Train agents to reason over and use real tools via MCP servers — Slack, HubSpot, Linear, and more.
Available as
Three types of simulated environments — each designed to match the systems agents operate in production.
Browser-based SaaS and productivity apps.
macOS- and Windows-like operating system environments.
Real enterprise tools — Slack, HubSpot, Linear, and more.
