Grooper Help - Version 25.0
25.0.0024 2,166

Pattern-Based Separation

Extractor Based Provider Grooper.Capture

Creates a new document whenever the configured Value Extractor produces one or more results on a Batch Page.

Remarks

Overview

The Pattern-Based Separator class implements separation logic that creates a new document at each page where the specified Value Extractor finds a match. This approach is ideal for scenarios where a specific pattern, keyword, barcode, or other extracted value reliably indicates the start of a new document.


Purpose and Role

  • Pattern-Driven Separation:
    Automates document separation by detecting the presence of a configured pattern or value on each page.
  • Flexible Criteria:
    Supports a wide range of extraction strategies, including regular expressions, barcodes, OMR, and more, as defined by the Value Extractor.
  • Simple Configuration:
    Requires only the assignment of a Value Extractor to define separation points.

Usage Scenarios

  • Barcode or Label Sheets:
    Insert a barcode or label at the start of each document and configure the Value Extractor to detect it.
  • Keyword-Based Separation:
    Use a regular expression extractor to split documents at pages containing a specific keyword or phrase.
  • Form or Section Headers:
    Detect the start of new forms or sections based on unique headers or patterns.

Best Practices

  • Configure the Value Extractor to match only those patterns that reliably indicate a new document.
  • Test extraction logic on representative batches to ensure accurate separation and avoid false positives.
  • Combine with other separation providers or review steps for complex workflows.

Properties

NameTypeDescription
Scanning Support String

Indicates whether this provider can be used in real time when scanning.

See Also

Used By

Notification