Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

ContextScopes

Grooper.Extract

Determines the scope of context feature extraction.

Remarks

Overview

The ContextScopes enumeration controls the method used to gather context features for each candidate value in a Field Class. The context scope determines the spatial or logical relationship between a value and its surrounding features, which are used for machine learning-based classification and extraction.

Choosing the right context scope is essential for accurately identifying the correct value among multiple candidates, especially in documents where similar values may appear in different locations or contexts.

Available Modes

  • Zonal: Extracts features from one or more rectangular regions (zones) defined relative to each value.
  • Flow: Extracts features that occur before and/or after the value in the text flow (reading order).
  • Self: Extracts features that are contained within the value itself.
  • Nearest: Extracts a limited number of features that are spatially closest to the value, with optional direction and distance filtering.

Usage Guidance

  • Use Zonal for semi-structured documents where labels or key features consistently appear in fixed locations relative to the value (e.g., invoices).
  • Use Flow for natural language or unstructured documents where context is determined by surrounding words or phrases (e.g., contracts).
  • Use Self when the features that define a value are contained within it (e.g., paragraphs or clauses).
  • Use Nearest for flexible, proximity-based context extraction, especially when document layouts vary or context is not strictly zonal or flow-based.

The selected context scope affects which additional properties are visible and configurable, such as 'Context Zones', 'Max Instances', 'Geo Filter', 'Flow Filter', 'Max Distance', and 'GeoTag Features'.

> Tip:
> Experiment with different context scopes and review the extracted features in the 'Tester' tab to determine which mode best captures > the distinguishing context for your use case.

For more information, see the documentation for Field Class, Value Extractor, and related context extraction properties.

Can be one of the following values:

NameValueDescription

Used By

Notification