Self-Hosted Data

📘

This feature is available to Enterprise-tier customers only.

If you're interested in using Studio's Self-Hosted Data capabilities, please contact the team at scale_studio@scale.com.

📘

This feature is only available for certain data types.

Currently, Studio's Self-Hosted Data capabilities are available for image annotation, document transcription, and certain categorization task types. If you are interested in support for other task types, please contact the team at scale_studio@scale.com.

Typically, when customers upload attachments to Scale Studio, the platform processes the asset for optimization and quality purposes (for example, videos are processed for faster rendering). Self-Hosted Data allows customers to use Scale Studio without their data passing through Scale’s servers by using private URLs.

Private URLs

Private URLs are URLs only certain people are able to access. For example, there are ways to configure URL permissions so that only employees on a VPN network are able download and view content. This can be done by configuring cloud storage bucket settings, such as S3.

Example:

{
    "Version": "2012-10-17",
    "Id": "S3PolicyId1",
    "Statement": [
        {
            "Sid": "IPAllow",
            "Effect": "Allow",
            "Principal": "*",
            "Action": "s3:*",
            "Resource": [
                "arn:aws:s3:::BUCKET_NAME",
                "arn:aws:s3:::BUCKET_NAME/*"
            ],
            "Condition": {
                "IpAddress": {
                    "aws:SourceIp": "YOUR_VPN_IP_ADDRESS"
                }
            }
        }
    ]
}

You can verify that these URLs can only be opened on your VPN connection. These URLs will look like https://{BUCKET_NAME}.s3.us-{REGION}.amazonaws.com/{FILENAME}.

After generating a list of URLs, you can upload these via CSV using the attachment_url column name.

If images are not showing up on your private network and/or you notice CORS errors in the network tab, you may need to set CORS headers.

Example CORS configuration:

[
    {
        "AllowedHeaders": [
            "*"
        ],
        "AllowedMethods": [
            "GET",
            "HEAD"
        ],
        "AllowedOrigins": [
            "*"
        ],
        "ExposeHeaders": []
    }
]

To learn more and utilize this feature, please reach out to scale_studio@scale.com.

Updated 10 months ago