Reference documentation and code samples for the Google Cloud Discovery Engine V1 Client class DocumentProcessingConfig.
A singleton resource of DataStore . It's empty when DataStore is created, which defaults to digital parser. The first call to DataStoreService.UpdateDocumentProcessingConfig method will initialize the config.
Generated from protobuf message google.cloud.discoveryengine.v1.DocumentProcessingConfig
Namespace
Google \ Cloud \ DiscoveryEngine \ V1Methods
__construct
Constructor.
data
array
Optional. Data for populating the Message object.
↳ name
string
The full resource name of the Document Processing Config. Format: projects/*/locations/*/collections/*/dataStores/*/documentProcessingConfig
.
↳ chunking_config
Google\Cloud\DiscoveryEngine\V1\DocumentProcessingConfig\ChunkingConfig
Whether chunking mode is enabled.
↳ default_parsing_config
Google\Cloud\DiscoveryEngine\V1\DocumentProcessingConfig\ParsingConfig
Configurations for default Document parser. If not specified, we will configure it as default DigitalParsingConfig, and the default parsing config will be applied to all file types for Document parsing.
↳ parsing_config_overrides
array| Google\Protobuf\Internal\MapField
Map from file type to override the default parsing configuration based on the file type. Supported keys: * pdf
: Override parsing config for PDF files, either digital parsing, ocr parsing or layout parsing is supported. * html
: Override parsing config for HTML files, only digital parsing and layout parsing are supported. * docx
: Override parsing config for DOCX files, only digital parsing and layout parsing are supported. * pptx
: Override parsing config for PPTX files, only digital parsing and layout parsing are supported.
getName
The full resource name of the Document Processing Config.
Format: projects/*/locations/*/collections/*/dataStores/*/documentProcessingConfig
.
string
setName
The full resource name of the Document Processing Config.
Format: projects/*/locations/*/collections/*/dataStores/*/documentProcessingConfig
.
var
string
$this
getChunkingConfig
Whether chunking mode is enabled.
hasChunkingConfig
clearChunkingConfig
setChunkingConfig
Whether chunking mode is enabled.
$this
getDefaultParsingConfig
Configurations for default Document parser.
If not specified, we will configure it as default DigitalParsingConfig, and the default parsing config will be applied to all file types for Document parsing.
hasDefaultParsingConfig
clearDefaultParsingConfig
setDefaultParsingConfig
Configurations for default Document parser.
If not specified, we will configure it as default DigitalParsingConfig, and the default parsing config will be applied to all file types for Document parsing.
$this
getParsingConfigOverrides
Map from file type to override the default parsing configuration based on the file type. Supported keys:
-
pdf
: Override parsing config for PDF files, either digital parsing, ocr parsing or layout parsing is supported. -
html
: Override parsing config for HTML files, only digital parsing and layout parsing are supported. -
docx
: Override parsing config for DOCX files, only digital parsing and layout parsing are supported. -
pptx
: Override parsing config for PPTX files, only digital parsing and layout parsing are supported.
setParsingConfigOverrides
Map from file type to override the default parsing configuration based on the file type. Supported keys:
-
pdf
: Override parsing config for PDF files, either digital parsing, ocr parsing or layout parsing is supported. -
html
: Override parsing config for HTML files, only digital parsing and layout parsing are supported. -
docx
: Override parsing config for DOCX files, only digital parsing and layout parsing are supported. -
pptx
: Override parsing config for PPTX files, only digital parsing and layout parsing are supported.
$this