Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Spell Corrector

OCR Repair Options Grooper.Core

Defines options for spell-correcting the text value of a Data Field.

Remarks

The SpellCorrector class provides configurable options for repairing and correcting OCR text in extracted data fields.

This object can be assigned to a Data Field's 'Spell Correction' property to automatically fix common OCR errors and apply custom spelling corrections.

Overview

SpellCorrector combines general OCR repair (such as fixing character case and ambiguous characters) with targeted spelling correction using a configurable Value Extractor.

  • The 'Correction Extractor' property allows you to define custom logic for identifying and correcting specific spelling errors in field values.

Usage

  1. Assign a Spell Corrector to a Data Field's 'Spell Correction' property.
  2. Configure the desired OCR repair options ('Repair Case' to fix case errors).
  3. Optionally, set 'Correction Extractor' to a Value Extractor that matches incorrect values and outputs the corrected text.

During extraction or validation, the SpellCorrector will:

  • Apply general OCR repairs to the field's value.
  • Use the CorrectionExtractor to find and replace spelling errors with the correct values.

Example

  • Use a Pattern Match extractor as the CorrectionExtractor to replace common OCR misreads (e.g., "lnvoice" → "Invoice").
  • Enable to ensure all-caps fields are corrected to uppercase.

Properties

NameTypeDescription

See Also

Used By

Notification