Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Select Page

Value Extractor Grooper.Extract

Selects and outputs the full content of one or more pages from a document, based on page number and/or content criteria.

Remarks

The Select Page extractor enables you to extract the entire content of selected pages from a document as Data Instances.
This is useful for scenarios where you need to capture whole pages for downstream processing, validation, or archival.

Overview

Select Page provides flexible page selection using both page numbers and content-based criteria.
You can specify which pages to include using the 'Page Filter' property, and further refine selection by assigning a Value Extractor to the 'Extractor' property. The output can be configured to return each page separately or combine all selected pages into a single result.

Configuration Guidance

  • Use 'Page Filter' to target specific pages by number, range, or relative position.
  • Assign an 'Extractor' to filter pages by content, such as keywords, barcodes, or patterns.
  • Enable 'Combine' to merge all selected pages into one output instance, or leave disabled to process each page individually.

Usage Scenarios

  • Extract First and Last Page:
    Set 'Page Filter' to 1,-1 to select the first and last page.
  • Extract Pages with a Keyword:
    Assign a Pattern Match extractor to 'Extractor' to select pages containing a specific word or phrase.
  • Combine Multi-Page Sections:
    Enable 'Combine' to merge all selected pages into a single result for processing as a unit.

Diagnostics

  • Selection Log:
    Logs the page indices selected by 'Page Filter' and any content-based filtering.
  • Output Log:
    Indicates whether pages were combined or returned separately, and the number of output instances.

These artifacts are accessible via the diagnostic interface and can be used to validate and tune your Select Page configuration.

Properties

NameTypeDescription

Used By

Notification