Nucleus logo

Document AI

Extract data from complex documents in seconds. At human-level accuracy. No templates.

  • blend
  • flexport
  • brex
  • sap
  • doma
  • paypal
  • square
  • openai
  • nvidia
  • general-motors

Extract any field from

packing lists
commercial invoices
bills of lading
mortgage applications
insurance cards
arrival notices

at 99%+ accuracy

OFFERINGS

Explore Options

Self-Serve

$450/mo.

Minimum spend. Starting at $0.08/doc

Ideal for companies processing up to 5K documents per month. Self-serve, models-only document processing.

Document Types

Accounts Payable Invoices:
$0.16 / invoice
Commercial Invoices: $0.25 / invoice
Bills of Lading: $0.20 / bill
Receipts:: $0.08 / receipt

Number of Users

3 users

Supported Languages

English

Taxonomy

Pre-defined taxonomies can be found here. Self-labeled custom fields are available.

Quality

Tooling to review and edit results is provided

Latency

Less than 5 seconds

Human-in-the-loop QA

Models only

Support

In-product Intercom services

Features

  • High accuracy models
  • No templates
  • API integration
  • Line item extraction
  • Validation rules
  • Review tooling
  • Project metrics
  • Customized models
  • Human-in-the-loop QA
  • Intelligent data insights

Start processing your documents with Document AI Go for free today!

Why Scale Document AI

Challenges Our Customers Faced Before Document AI

  • graph wrench icon

    Encountered OCR’s Limits

  • clock icon

    Suffered from Limited Quality

  • technology icon

    Dealt with Chronic Delays

  • technology icon

    Spent Excessively on In-House Engineering

To address these shortcomings, we built Document AI.

How It Works

Operationalize Machine Learning

How It Works

Optional
human-in-the-loop QA is available for complex use cases, and is also used to improve model performance.

industries

Logistics, Finance, and Healthcare Depend On Us

logistics box icon

Bills of Lading, Commercial Invoices, Packing Lists, and more

Reduce delays when clearing customs and delivering goods, minimize operational costs, and get paid on time. Document AI is template-free, fast, and extracts data from your documents at human-level accuracy.

client.createDataExtractionTask({
  callback_url: 'http://www.example.com/callback',
  instruction: 'Extract fields and link relationships.',
  params: {
      attachments: [
        {
          type: 'pdf'
          content: 'bill_of_lading.pdf'
        }
      ],
      labels: ['M&No', 'Description', ...],
      boundingboxes: true,
  }
});

QUALITY ASSURANCE

Accuracy and Fast Turnaround are Guaranteed

Human-Level Accuracy

Our use of Computer Vision and Natural Language Processing models, with fine-tuning, enables much higher quality data extraction than either hard-coded templates or human annotation. We optionally provide human-in-the-loop QA when needed.

ML Means Continual Improvement

Our models are trained on millions of data points, and further refined for each customer use case. Thus, our ML models achieve much higher quality, generalize across challenging document types, and continually improve as we continue to process more data.

Transparency In Metrics

To increase your operational efficiency, you get access to our metrics dashboard to review your pipeline performance, visualization tools to audit your data easily, and our feedback platform to provide instructions.

Scale's dashboard

See It In Action

Get to Know Document AI

CUSTOMERS

Trusted by World Class Companies

“Scale’s machine learning-based Document AI is very different from traditional OCR models, or template-based learning. No templates, high quality, and low latency every time. We rely on Scale for document processing, because with higher extraction accuracy, almost zero human labor is required afterward to correct it. With lower latency, we can enable products like air freight where document data has to arrive much faster since air shipments take less than two days.”

James Chen

Chief Technology Officer, Flexport

“The combination of the Blend platform with Scale’s Document AI ensures the swift, accurate extraction and validation of data from documents, enabling bankers to make data-driven decisions with confidence.”

Jeff Braddock

Manager of Product Partnerships, Blend

“Unlike OCR that basically just extracts information and then leaves to our engineers all the work of understanding the context, Scale Document actually figures out the context for us, and that requires minimal work on our side to actually build and integrate the whole pipeline.”

Henrique Dubugras

Founder and Co-CEO, Brex

“OpenAI threw a bunch of tasks at Scale AI with difficult characteristics, including tight latency requirements and significant ambiguity in correct answers. In response, Scale worked closely with us to adjust their QA systems to our needs.”

Geoffrey Irving

Member of Technical Staff, OpenAI

“Scale has provided the fuel to put our machine learning systems on overdrive. They make sure the highest quality training data is there in time to meet our aggressive roadmap. Lenders and borrowers will experience faster and more efficient closings sooner as a result.”

Andy Mahdavi

Chief Data Science Officer, Doma

With Scale Document AI, document processing is a breeze.