Nucleus logo

Nucleus

Command and control for your ML dataset.

Nucleus dashboard

HOW IT WORKS

The dataset IDE for machine learning.

Optimize your labeling spend by identifying class imbalance, errors, and edge cases in your data with Scale Nucleus.

WHY NUCLEUS

Improving your models starts with improving your data.

Nucleus helps teams build better datasets. Bring your data, labels, and model predictions together to debug your ML models and improve your datasets.

  • Visualize, explore, and slice

    See insights, search on custom metadata, and curate data slices to track model performance on specific scenarios.

  • Debug your models

    Filter data on model performance metrics or explore interactive confusion matrices to quickly find specific examples of model failure, e.g., false positives.

  • Optimize labeling spend

    Curate unlabeled data with active learning, then mine rare edge cases to prioritize the highest-value data to send for labels next.

  • Label flexibly

    Send data to Scale with one click for labeling, label it yourself with Nucleus’s built-in editor, or import and export external labels via API.

  • Collaborate with your team

    Share links to slices, queries, and individual examples. ML engineers, labelers, and data ops specialists can all collaborate on the same platform.

  • Integrate with the Nucleus API

    Automate dataset uploads, add metadata, upload model predictions, and export from Nucleus using its intuitive API.

Data Inputs

Supported Annotation Types.

Image

  • Bounding Box Bounding Boxes
  • Classification Classification
  • Segmentation Segmentation
  • Polygons Polygons
  • Polylines Polylines
  • Cuboids Cuboids
  • Points Points

Video

  • Bounding Boxes Bounding Boxes
  • Classification Classification
  • Segmentation Segmentation
  • Polygons Polygons
  • Polylines Polylines
  • Cuboids Cuboids
  • Points Points

3D and Sensor Fusion

  • Cuboids Cuboids
  • Point Point
"As Nuro works to ensure efficient deliveries as safely as possible, we depend on tools like Nucleus to curate edge cases which we can use to train ever more accurate and capable models."

Jack Guo

Head of Autonomy Platform, Nuro

"The powerful search capabilities and easy-to-use tools made it easy for us to get started with our existing library of annotations."

Oliver Monson

Senior Manager, Data Operations, Velodyne Lidar

“KeepTruckin encounters all manners of surprising edge cases in real world data collection, so when it comes to knowing we’re labeling the most valuable subset of our collected data, we turn to Scale Nucleus. Its intuitive visualizations, query engine and Autotag help our teams improve both data quality and models, all in the same motion.”

Ali Rehan

Engineering Manager AI/Vision Products, KeepTruckin

“Once our data was uploaded, we were able to train, validate and deploy 3 classifiers to our full dataset to identify police cars, ambulances, and firetrucks in just 4 hours, identifying a large number of these rarer case images to send to labeling. This process would have taken weeks without Nucleus.”

Varun Sundar Rabindranath

ML and Perception Engineer, Magna International

Pricing

Explore Pricing

Save when you subscribe

For a monthly subscription, enjoy unlimited data volume and seats, 50% reductions in usage rates, and access to enterprise features.

This is your best option if:

  • You want to get started right away without committing upfront

  • You have a smaller dataset
(<200,000 items)

Subscription / Month

Free

Seats

20

Price / 1,000 Items

Data Volume

$9.00

Querying

$0.002

Embedding Indexing

$2.00

Autotag Creation

$0.02

Model Error Computation

$0.02

Scale Model Zoo Inference

$3.00

Save when you subscribe

For a monthly subscription, enjoy unlimited data volume and seats, 50% reductions in usage rates, and access to enterprise features.

This is your best option if:

  • You plan to use Nucleus frequently.

  • You would benefit from unlimited data volume and seats.

  • You want to host data yourself (Privacy Mode*) instead of uploading to Nucleus.

Subscription / Month

$7,500

Seats

Unlimited

Price / 1,000 Items

Data Volume

Free

Querying

$0.001

Embedding Indexing

$1.00

Autotag Creation

$0.01

Model Error Computation

$0.01

Scale Model Zoo Inference

$1.50

All pricing is for data that does not contain Restricted Information, as defined by Scale AI’s Master Software and Services Agreement. For projects that contain Restricted Information, or for public sector or government projects that require specialized data handling or security, please contact us.

CUSTOMERS

Trusted by World Class Companies

Scale is trusted by leading machine learning teams to develop more accurate models.

  • airbnb
  • brex
  • etsy
  • general-motors
  • flexport
  • instacart
  • blend
  • nvidia
  • openai
  • paypal
  • sap
  • magna
  • square
  • toyota
  • airforce
  • usarmy
  • aurora
  • irobot
  • lyft
  • linkedin
  • nuro
  • pinterest
  • skydio
  • zoox

FAQ