{"useAdvancedPdfParsing":boolean,// parser"advancedParser":{object (AdvancedParser)},"layoutParser":{object (LayoutParser)},"llmParser":{object (LlmParser)}// Union type}
AdvancedParser
Specifies the advanced parsing for RagFiles.
Fields
useAdvancedPdfParsingboolean
Whether to use advanced PDF parsing.
JSON representation
{"useAdvancedPdfParsing":boolean}
LayoutParser
Document AI Layout Parser config.
Fields
processorNamestring
The full resource name of a Document AI processor or processor version. The processor must have typeLAYOUT_PARSER_PROCESSOR. If specified, theadditionalConfig.parse_as_scanned_pdffield must be false. Format: *projects/{projectId}/locations/{location}/processors/{processorId}*projects/{projectId}/locations/{location}/processors/{processorId}/processorVersions/{processor_version_id}
maxParsingRequestsPerMininteger
The maximum number of requests the job is allowed to make to the Document AI processor per minute. Consulthttps://cloud.google.com/document-ai/quotasand the Quota page for your project to set an appropriate value here. If unspecified, a default value of 120 QPM would be used.
globalMaxParsingRequestsPerMininteger
The maximum number of requests the job is allowed to make to the Document AI processor per minute in this project. Consulthttps://cloud.google.com/document-ai/quotasand the Quota page for your project to set an appropriate value here. If this value is not specified, maxParsingRequestsPerMin will be used by indexing pipeline as the global limit.
[[["Easy to understand","easyToUnderstand","thumb-up"],["Solved my problem","solvedMyProblem","thumb-up"],["Other","otherUp","thumb-up"]],[["Hard to understand","hardToUnderstand","thumb-down"],["Incorrect information or sample code","incorrectInformationOrSampleCode","thumb-down"],["Missing the information/samples I need","missingTheInformationSamplesINeed","thumb-down"],["Other","otherDown","thumb-down"]],["Last updated 2025-07-31 UTC."],[],[],null,["# RagFileParsingConfig\n\nSpecifies the parsing config for RagFiles.\nFields `useAdvancedPdfParsing` \n**(deprecated)** `boolean` \n| This item is deprecated!\n\nWhether to use advanced PDF parsing. \n`parser` `Union type` \nThe parser to use for RagFiles. `parser` can be only one of the following:\n`advancedParser` `object (`[AdvancedParser](/vertex-ai/docs/reference/rest/v1beta1/RagFileParsingConfig#AdvancedParser)`)` \nThe Advanced Parser to use for RagFiles.\n`layoutParser` `object (`[LayoutParser](/vertex-ai/docs/reference/rest/v1beta1/RagFileParsingConfig#LayoutParser)`)` \nThe Layout Parser to use for RagFiles.\n`llmParser` `object (`[LlmParser](/vertex-ai/docs/reference/rest/v1beta1/projects.locations.ragCorpora#LlmParser)`)` \nThe LLM Parser to use for RagFiles. \n\nAdvancedParser\n--------------\n\nSpecifies the advanced parsing for RagFiles.\nFields `useAdvancedPdfParsing` `boolean` \nWhether to use advanced PDF parsing. \n\nLayoutParser\n------------\n\nDocument AI Layout Parser config.\nFields `processorName` `string` \nThe full resource name of a Document AI processor or processor version. The processor must have type `LAYOUT_PARSER_PROCESSOR`. If specified, the `additionalConfig.parse_as_scanned_pdf` field must be false. Format: \\* `projects/{projectId}/locations/{location}/processors/{processorId}` \\* `projects/{projectId}/locations/{location}/processors/{processorId}/processorVersions/{processor_version_id}`\n`maxParsingRequestsPerMin` `integer` \nThe maximum number of requests the job is allowed to make to the Document AI processor per minute. Consult \u003chttps://cloud.google.com/document-ai/quotas\u003e and the Quota page for your project to set an appropriate value here. If unspecified, a default value of 120 QPM would be used.\n`globalMaxParsingRequestsPerMin` `integer` \nThe maximum number of requests the job is allowed to make to the Document AI processor per minute in this project. Consult \u003chttps://cloud.google.com/document-ai/quotas\u003e and the Quota page for your project to set an appropriate value here. If this value is not specified, maxParsingRequestsPerMin will be used by indexing pipeline as the global limit."]]