Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Flow

Layout Provider Grooper.Extract

Detects cases where array elements occur one after another in the text flow of a document.

Remarks

The Flow layout provider is used to extract arrays of values that appear sequentially within the text flow of a document, such as lists, tables without explicit grid lines, or repeated fields separated by whitespace or punctuation.

This provider determines whether two or more elements are part of the same array by analyzing their order and proximity in the text, as well as the characters or whitespace that separate them.

How It Works

  • The Flow layout provider evaluates the text between candidate elements using a regular expression (see the 'Separator Expression' property) and/or character/geometric distance constraints.
  • If the separator matches the configured pattern and the elements are within the allowed distance, they are considered part of the same array.
  • Text preprocessing options (see the 'Preprocessing Options' property) can be applied to normalize whitespace, line breaks, and other control characters before extraction.

Configuration Guidance

  • Use the 'Separator Expression' property to define what constitutes a valid separator between array elements. For example, to allow a comma or semicolon with optional spaces, use ,\s*|;\s*.
  • Adjust the 'Maximum Character Distance' and 'Maximum Geometric Distance' properties to control how far apart elements can be and still be considered part of the same array.
  • Enable or configure text preprocessing to handle documents with inconsistent formatting, line breaks, or large whitespace gaps.

Usage Notes

  • The Flow layout provider is ideal for extracting repeated values from unstructured or semi-structured documents where array elements are not separated by explicit table structures.
  • Proper configuration of separator and distance properties is essential for accurate extraction, especially in documents with variable formatting.
  • For more information on preprocessing, see the documentation for Text Preprocessor.

Properties

NameTypeDescription

See Also

Used By

Notification