Reference documentation and code samples for the Google Cloud Document Ai V1 Client class OcrConfig.
Config for Document OCR.
Generated from protobuf message google.cloud.documentai.v1.OcrConfig
Namespace
Google \ Cloud \ DocumentAI \ V1Methods
__construct
Constructor.
data
array
Optional. Data for populating the Message object.
↳ hints
↳ enable_native_pdf_parsing
bool
Enables special handling for PDFs with existing text information. Results in better text extraction quality in such PDF inputs.
↳ enable_image_quality_scores
bool
Enables intelligent document quality scores after OCR. Can help with diagnosing why OCR responses are of poor quality for a given input. Adds additional latency comparable to regular OCR to the process call.
↳ advanced_ocr_options
array
A list of advanced OCR options to further fine-tune OCR behavior. Current valid values are: - legacy_layout
: a heuristics layout detection algorithm, which serves as an alternative to the current ML-based layout detection algorithm. Customers can choose the best suitable layout algorithm based on their situation.
↳ enable_symbol
bool
Includes symbol level OCR information if set to true.
↳ compute_style_info
bool
Turn on font identification model and return font style information. Deprecated, use PremiumFeatures.compute_style_info instead.
↳ disable_character_boxes_detection
bool
Turn off character box detector in OCR engine. Character box detection is enabled by default in OCR 2.0 (and later) processors.
↳ premium_features
getHints
Hints for the OCR model.
hasHints
clearHints
setHints
Hints for the OCR model.
$this
getEnableNativePdfParsing
Enables special handling for PDFs with existing text information. Results in better text extraction quality in such PDF inputs.
bool
setEnableNativePdfParsing
Enables special handling for PDFs with existing text information. Results in better text extraction quality in such PDF inputs.
var
bool
$this
getEnableImageQualityScores
Enables intelligent document quality scores after OCR. Can help with diagnosing why OCR responses are of poor quality for a given input.
Adds additional latency comparable to regular OCR to the process call.
bool
setEnableImageQualityScores
Enables intelligent document quality scores after OCR. Can help with diagnosing why OCR responses are of poor quality for a given input.
Adds additional latency comparable to regular OCR to the process call.
var
bool
$this
getAdvancedOcrOptions
A list of advanced OCR options to further fine-tune OCR behavior. Current valid values are:
-
legacy_layout
: a heuristics layout detection algorithm, which serves as an alternative to the current ML-based layout detection algorithm.
Customers can choose the best suitable layout algorithm based on their situation.
setAdvancedOcrOptions
A list of advanced OCR options to further fine-tune OCR behavior. Current valid values are:
-
legacy_layout
: a heuristics layout detection algorithm, which serves as an alternative to the current ML-based layout detection algorithm.
Customers can choose the best suitable layout algorithm based on their situation.
var
string[]
$this
getEnableSymbol
Includes symbol level OCR information if set to true.
bool
setEnableSymbol
Includes symbol level OCR information if set to true.
var
bool
$this
getComputeStyleInfo
Turn on font identification model and return font style information.
Deprecated, use PremiumFeatures.compute_style_info instead.
bool
setComputeStyleInfo
Turn on font identification model and return font style information.
Deprecated, use PremiumFeatures.compute_style_info instead.
var
bool
$this
getDisableCharacterBoxesDetection
Turn off character box detector in OCR engine. Character box detection is enabled by default in OCR 2.0 (and later) processors.
bool
setDisableCharacterBoxesDetection
Turn off character box detector in OCR engine. Character box detection is enabled by default in OCR 2.0 (and later) processors.
var
bool
$this
getPremiumFeatures
Configurations for premium OCR features.
hasPremiumFeatures
clearPremiumFeatures
setPremiumFeatures
Configurations for premium OCR features.
$this