Grooper Help - Version 25.0
25.0.0024 2,166

Labeled OMR

OMR Extractor Grooper.Extract

Reads a group of one or more checkboxes located nearby text labels.

Remarks

LabeledOMR extracts the state of checkboxes that are positioned near text labels, supporting both rectangular and circular checkboxes, and is Label Set-aware for automatic label detection.


Overview

LabeledOMR is designed for forms where checkboxes are associated with nearby text labels, such as survey questions, option lists, or grouped selections. It detects checkboxes by their proximity to labels, using either a configured label extractor or Label Sets defined on the parent Data Field. Both rectangular and circular checkboxes are supported.

Before using this extractor, ensure that source documents have been processed through the Box Removal command during the Recognize activity, or directly during Image Processing, to enable reliable detection of rectangular checkboxes.


Key Features

  • Label-Based Checkbox Detection:
    Associates checkboxes with nearby text labels, using a configurable label extractor or Label Sets.

  • Header Extraction:
    Optionally, use a header extractor to disambiguate between multiple groups of checkboxes on a page.

  • Group and Individual Label Matching:
    Supports both group-based and individual label matching, controlled by the 'Group Match' property.

  • Noise Filtering:
    The 'Maximum Noise' property limits the number of extraneous characters allowed in the label group region.

  • Line Consideration:
    The 'Consider Lines' property improves grouping accuracy when multiple identical labels appear on a form.

  • Label Set Awareness:
    If the Label Extractor or Header Extractor is left empty, Label Set labels are used automatically.


Usage Examples

  • Label Set Extraction:
    Leave the Label Extractor empty and define labels in the Label Set for the parent Data Field. LabeledOMR will automatically use these labels.

  • Custom Label Extraction:
    Configure a Pattern Match, List Match, or other extractor to match individual or grouped labels, and set the 'Group Match' and 'Minimum Label Count' properties as needed.

  • Header Disambiguation:
    Use a Header Extractor to select the correct group of checkboxes when multiple similar groups are present on a page.


Notes

  • This extractor is Label Set-aware, and will use Label Set labels automatically if the Label Extractor or Header Extractor properties are left empty.
  • For rectangular checkbox detection, ensure Box Removal is performed before extraction.
  • For more details, see the documentation for each property and the Labeled OMR wiki page.

Properties

NameTypeDescription
General
Options

Used By

Recommended Content

Notification