Document AI offers multiple products to extract information from documents for different use cases:
- Form Parser
-
Custom extractor, which offers three different modeling types:
- Foundation model
- Custom model based
- Custom template based
Form Parser
Form Parser extracts key-value pairs (KVP), tables, selection marks (checkboxes), and generic fields to augment and automate extraction. It can extract up to 11 generic entities and checkboxes out of the box. You don't specify the fields (schema), you want to extract with the Form Parser. The model detects and returns entities of interest from each page of documents.
Custom extractor
The custom extractor extracts entities you define in schema and offers three modeling options: foundation model, custom model based, and custom template based. Given promising results from foundation models with little to no training data, we recommend starting with the foundation model as the first option and try out other options as needed. The foundation models do zero- to few-shot prediction, based on up to 5 labeled documents in the dataset, and fine-tuned prediction with more than 10 labeled documents in the dataset.
Because foundation models typically require fewer training documents, they're recommended as the first option for all variable layouts.
Layout Parser
Layout Parser transforms documents in various formats into structured representations, making content like paragraphs, tables, lists, and structural elements like headings, page headers, and footers accessible, and creating context-aware chunks that facilitate information retrieval in a range of generative AI and discovery apps.

