Products
Scale RapidThe fastest way to production-quality labels.
Scale StudioLabeling infrastructure for your workforce.
Scale 3D Sensor FusionAdvanced annotations for LiDAR + RADAR data.
Scale ImageComprehensive annotations for images.
Scale VideoScalable annotations for video data.
Scale TextSophisticated annotations for text-based data.
Scale AudioAudio Annotation and Speech Annotation for NLP.
Scale MappingThe flexible solution to develop your own maps.
Scale CatalogCreate, enrich, and enhance eCommerce data.
Scale Enterprise AIModels to support your business use cases.
Scale NucleusThe mission control for your data
Scale LaunchShip and track your models in production
Scale Content UnderstandingManage content for better user experiences
Scale InstantMLNext-day machine learning models, without ML expertise
Scale SpellbookThe platform for large language model apps
Scale SyntheticGenerate synthetic data
Solutions
Retail & eCommerce
Defense
Logistics
Autonomous Vehicles
Robotics
AR/VR
Content & Language
RLHF
Smart Port Lab
Federal LLMs
Resources
Resource Library
Blog
Events
Open Datasets
Interviews
Documentation
Guides
Customers
Pricing
Conference
AI Readiness Report 2022
Company
Scale Rapid Open Source Licenses
- Scale Main Services Agreement
- Scale Acceptable Use Policy
- Scale End User Terms of Use
- Scale Website Terms and Conditions
- Scale Privacy Policy
- Scale Subprocessors
- Scale Cookie Policy
- Scale Event Terms & Conditions and Guidelines
- Scale Hackathon Terms and Conditions
- Scale Product Terms
- Scale Rapid Open Source Licenses
- Scale Nucleus Open Source Licenses
- CADC Terms of Use
- Oxford Dataset Terms of Use
- PandaSet Terms of Use
- Ukraine Dataset Terms of Use
- [Legacy] Scale Master Software and Services Agreement
- [Legacy] Scale Privacy Policy
Attributions for open source datasets made available in Scale Rapid are available below. This document contains licensing information relating to the use of free and open-source software (FOSS) with or within the Scale Rapid software. Any terms, conditions, or restrictions on FOSS included within the Scale Rapid software that are not included within the original FOSS licenses are offered and imposed by Scale alone. The authors, licensors, and distributors of the FOSS disclaim all express or implied conditions, representations, and warranties relating to the FOSS and any liability arising from use and distribution of the FOSS. This document identifies the FOSS packages made available in the Scale Rapid software, the FOSS licenses that Scale believes govern those FOSS packages, and copyright and license notices associated with Scale’s use of the FOSS. While Scale has sought to provide complete and accurate licensing information for each FOSS package, Scale does not represent or warrant that the licensing information provided herein is correct or error-free. Recipients of the product should investigate the identified FOSS packages to confirm the accuracy of the licensing information provided herein. Recipients are also encouraged to notify Scale of any inaccurate information or errors found in these notices. Certain FOSS licenses, such as the Mozilla Public License, require Scale to make available to recipients the source code corresponding to FOSS binaries distributed under those licenses. Recipients who would like to receive a copy of such source code should submit a request to Scale by post at: Scale AI, Inc. Attn: FOSS Requests 303 2nd St, Fl 5, San Francisco, CA 94107. Please identify in submitted FOSS requests: the FOSS packages for which you are requesting source code; the Scale product and version number with which the requested FOSS package was distributed; an email address at which Scale may contact you regarding the request (if available); and the postal address for delivery of the requested source code.
MNIST (http://yann.lecun.com/exdb/mnist/)
COCO 2020 (https://cocodataset.org/#home)
Copyright COCO Consortium
The annotations in this dataset are licensed under a Creative Commons Attribution 4.0 License. The COCO Consortium does not own the copyright of the images. Use of the images must abide by the Flickr Terms of Use. The users of the images accept full responsibility for the use of the dataset, including but not limited to use of any copies of copyrighted images that they may create from the dataset.
CIFAR-100 (https://www.cs.toronto.edu/~kriz/cifar.html)
Debagreement: Reddit 50K (https://scale.com/open-av-datasets/oxford)
This dataset is distributed by John Pougué-Biyong, Valentina Semenova, Alexandre Matton, Rachel Han, Aerin Kim, Renaud Lambiotte, and Doyne Farmer under a Creative Commons Attribution 4.0 International Public License (“CC BY 4.0”).
Wikipedia Links Data (https://code.google.com/archive/p/wiki-links/downloads)
This dataset is distributed by Sameer Singh, Amarnag Subramanya, Fernando Pereira, and Andrew McCallum under a Creative Commons BY license
Speech Commands Dataset (https://ai.googleblog.com/2017/08/launching-speech-commands-dataset.html)
This dataset is released under a Creative Commons BY 4.0 license.
FSDnoisy18k (https://zenodo.org/record/2529934#.Yz4TbezML6v)
This dataset is licensed under a Creative Commons BY 4.0 license. Iindividual audio clips are licensed under a Creative Commons BY 4.0 or CC0 1.0 Universal (CC0 1.0) license.
More information about the licensing can be found at https://zenodo.org/record/2529934#.Yz4TbezML6v