This document lists the system limits that apply to Document AI. Unlike quotas, system limits can't be changed.
Content limits
The following content limits apply to all Document AI processors.
Content limit | Value |
---|---|
Maximum image resolution (limit does not apply to PDF files) |
40 megapixels (per page if image contains multiple pages) |
Maximum file size for online processing requests | 40 MB |
Maximum file size for batch processing requests | 1 GB (40 MB for PDF file type in layout parser) |
Files per batch processing request | 5,000 files |
Processor limits
Limits are defined in the current list.
Extraction processors
Processor
Limits
Maximum pages (online/synchronous requests): | 15 |
Maximum pages (batch/offline/asynchronous requests): | 200 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 30 |
Maximum pages (online/synchronous requests): | 15 |
Maximum pages (batch/offline/asynchronous requests): | 100 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 30 |
Maximum pages (online/synchronous requests): | 15 |
Maximum pages (batch/offline/asynchronous requests): | 500 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 30 |
Classification processors
Processor
Limits
Maximum pages (online/synchronous requests): | 15 |
Maximum pages (batch/offline/asynchronous requests): | 200 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 30 |
Maximum pages (online/synchronous requests): | 15 |
Maximum pages (batch/offline/asynchronous requests): | 1000 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 30 |
Digitize processors
Processor
Limits
Maximum pages (online/synchronous requests): | 15 |
Maximum pages (batch/offline/asynchronous requests): | 500 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 30 |
Pretrained processors
Processor
Limits
Maximum pages (online/synchronous requests): | 15 |
Maximum pages (batch/offline/asynchronous requests): | 30 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 30 |
Maximum pages (online/synchronous requests): | 15 |
Maximum pages (batch/offline/asynchronous requests): | 15 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 15 |
Maximum pages (online/synchronous requests): | 2 |
Maximum pages (batch/offline/asynchronous requests): | 2 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 2 |
Maximum pages (online/synchronous requests): | 10 |
Maximum pages (batch/offline/asynchronous requests): | 200 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 30 |
Maximum pages (online/synchronous requests): | 2 |
Maximum pages (batch/offline/asynchronous requests): | 2 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 2 |
Maximum pages (online/synchronous requests): | 15 |
Maximum pages (batch/offline/asynchronous requests): | 50 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 30 |
Maximum pages (online/synchronous requests): | 2 |
Maximum pages (batch/offline/asynchronous requests): | 2 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 2 |
Maximum pages (online/synchronous requests): | 10 |
Maximum pages (batch/offline/asynchronous requests): | 10 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 10 |
Maximum pages (online/synchronous requests): | 15 |
Maximum pages (batch/offline/asynchronous requests): | 200 |
---|---|
Maximum pages (imageless mode online/synchronous requests): | 30 |
Limitations for Document AI
Document AI has the current limitations.
Criteria
Stable release July 2023
Dataset
- Maximum of 30,000 documents total
- Maximum of 250,000 pages total
Document import
- Maximum of 5,000 documents per import
- Maximum of 200 pages per document
Limits to train a Custom Document Extractor (CDE)
Model-based training (GA)
Template-based training (GA)
- Training dataset maximums: 25,000 documents; 100,000 pages
- Training dataset minimum: each label needs to be present on at least 1 label per 10 documents
- Test dataset maximums: 2,000 documents; 8,000 pages
- Test dataset minimum: every label on at least 10 documents
- Maximum of 200 pages per document
Template-based training (GA)
- Training dataset maximums: 300 documents, 300 pages
- Training dataset minimum: every label on at least on at least 3 documents
- Test dataset maximums: 2,000 documents; 8,000 pages
- Test dataset minimum: every label on at least 3 documents
- Maximum of 20 pages per document
Limits to train a Custom Document Classifier (CDC) or a Custom Document Splitter (CDS)
- Training dataset maximums: 30,000 documents; 100,000 pages
- Training dataset minimum: every label on at least 10 documents
- Test dataset maximums: 2,000 documents; 8,000 pages
- Test dataset minimum: every label on at least 2 documents
- Maximum of 200 pages per document
Labeling
- To get started, verify document labels meet defined minimum training and evaluation thresholds.
- To begin evaluating model performance for documents with layout variation, label at least 100 documents. Specifically, verify that each label exists on 50 documents in training and 50 in evaluation.
- Maximum allowed labels (fields): 150
- Label size limits (characters): Long items aren't well supported, but there's no explicit limit. Chunk documents into 800- or 1,000-token pieces, with 100 to 200 tokens overlapping between chunks. (Items longer than the overlapping area might run into quality issues.)
- Label occurrences in a document: No limit
Geographic coverage
- Regions generally supported: US, EU (multiregion)
- Regions with limited accessibility: Germany, Singapore, UK, Canada, India, Australia