Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

OCR Reader

Result Processor Grooper.Extract

Extracts text from a region near each Data Type output instance using OCR or existing OCR results.

Remarks

The OCR Reader result processor enables extraction of text from a specific region of a document, relative to each output instance, using either a configured OCR Profile or the document's existing full-page OCR results.

This processor is commonly used to extract values that are not directly available in the original OCR output, such as labels, values adjacent to anchors, or content in dynamically defined zones.

How It Works

  • For each output instance, the OCR Reader determines a region from which to extract text, based on the 'Region' and 'Relative To' properties.
  • If an OCR Profile is specified, OCR is performed on the region; otherwise, text is extracted from the document's existing OCR data.
  • Optional auto snap features can adjust the region to align with detected lines in the document.
  • A Value Extractor can be applied to the OCR results to further refine or filter the extracted value.
  • Output formatting options control how multiple values and line breaks are represented.

Configuration Guidance

  • Set the 'OCR Profile' property to use a specific OCR engine and settings for extraction, or leave it blank to use existing OCR data.
  • Use the 'Region' and 'Relative To' properties to define the extraction zone relative to each output instance.
  • Enable auto snap and configure 'Auto Snap Distance' and 'Auto Snap Margin' to align the region with document lines.
  • Use a Value Extractor to extract or validate specific patterns from the OCR results.
  • Adjust 'Value Separator' and 'Line Separator' to control output formatting for multi-value or multi-line results.

Usage Notes

  • The OCR Reader is ideal for extracting values that are not directly available in the main OCR output, such as values near anchors, labels, or in dynamic regions.
  • Proper configuration of the extraction region and OCR profile is essential for accurate results.
  • Use the 'Exclude Anchor' and 'Output Full Region' options to fine-tune the output and highlight regions during tuning.

Properties

NameTypeDescription

See Also

Used By

Notification