Field Class

Inherits From Extractor Node Namespace Grooper.Extract

A trainable binary classifier for locating specific information on a document using contextual features.

Remarks

The Field Class is a supervised machine learning object in Grooper, designed to identify the correct instance of a value among multiple candidates on a document by analyzing the surrounding context. Field Classes are ideal for documents where the same type of value (such as a date, name, or clause) may appear in multiple places, and the correct instance must be selected based on nearby words, phrases, or other features.

Overview

Field Classes are one of the three primary data extraction objects in Grooper, alongside Value Readers and Data Types.
They are created as children of a Project or the "Local Resources" folder of a Content Type.
Field Classes do not extract values on their own; they must be referenced by another object to participate in extraction.

Configuration

Field Classes use two Value Extractors:
- The 'Value Extractor' finds all possible candidate values (e.g., all dates on a page).
- The 'Feature Extractor' finds contextual features (e.g., words or phrases) near each candidate.
Additional properties control the context scope, feature zones, and classifier tuning.
The context can be defined by flow (text order), zones (spatial regions), or proximity (nearest features).

Training & Usage

After configuring extractors, use the Extractor Node - Tester tab to run extraction and view candidates and features.
Select the correct value in the results list and use the thumbs-up (positive) or thumbs-down (negative) buttons to train the classifier.
The classifier uses TF-IDF weighting to learn which features are most predictive of the correct value.
On future documents, the Field Class will score candidates based on how closely their context matches the training data.

Best Practices

Use Field Classes when the correct value cannot be determined by position alone, such as in contracts, legal documents, or unstructured text.
Provide diverse positive and negative training examples to improve accuracy.
Adjust context scope and feature extraction settings to best capture the distinguishing context for your use case.

Properties

Name Type Description

General

Value Type

Storage Type

►

Specifies the Storage Type for values output by this extractor.

Can be one of the following types:

Value	Description
Int64	Represents a 64-bit signed integer value for use in Data Fields and related objects.
Int32	Represents a 32-bit signed integer value for use in Data Fields and related objects.
GUID	Represents a globally unique identifier (GUID) for use in Data Fields and related objects.
Custom	Defines a custom value type, along with the logic for parsing and formatting values..
URL	Represents a Uniform Resource Locator (URL) for use in Data Fields and related objects.
String	Represents a text value for use in Data Fields and related objects.
Int16	Represents a 16-bit signed integer value for use in Data Fields and related objects.
Double	Represents a 64-bit floating-point (double) value for use in Data Fields and related objects.
Decimal	Represents a decimal (floating-point) numeric value for use in Data Fields and related objects.
DateTime	Represents a date and/or time value for use in Data Fields and related objects.
Boolean	Represents a Boolean (true or false) value for use in Data Fields and related objects.

The Value Type property determines the data type and formatting rules for values produced by this Field Class.

Choose a Storage Type that matches the kind of data you expect to extract (such as text, number, date, or boolean).
Each storage type exposes its own set of configuration options, accessible by expanding the property in the property grid.
Any extracted value that cannot be converted to the selected Storage Type will be discarded.

Typical Usage

Use a date storage type to ensure only valid dates are output.
Use a numeric storage type to restrict results to numbers and apply formatting or validation.
Use a text storage type for general string values, with optional delimiters for multi-value fields.

> Tip:
> Setting the correct value type helps ensure data quality and consistency in downstream processing.

Value Extractor

►

Specifies the Value Extractor used to find candidate values for this field.

Can be one of the following types:

Value	Description
Reference	Delegates extraction to another configured extractor, enabling reuse and centralization of extraction logic.
AI Column Extractor	Extracts structured content from documents with two-column layouts.
AI Schema Extractor	Extracts structured data from documents using a large language model (LLM) guided by a user-defined JSON schema.
Ask AI	Executes a completion using a large language model (LLM) and returns one hit for each choice in the response.
Detect Signature	Detects a signature within a specified region of a document page by measuring the percentage of the area that is filled.
Entity Recognition	Identifies and categorizes entities such as people, organizations, locations, and quantities in unstructured text.
Field Match	Matches the value stored in a previously-extracted field or table column.
Find Barcode	Searches for barcode values in document Layout Data previously detected during image processing.
Highlight Zone	Defines a region of a document to be visually highlighted, without extracting any data values.
Key Phrase Extraction	Identifies key concepts and topics in text using Azure AI Language key phrase extraction.
Label Match	Matches a list of one or more label values, using matching options defined by a Labeling Behavior.
Labeled OMR	Reads a group of one or more checkboxes located nearby text labels.
Labeled Value	Extracts a field presented as a label-value pair within a document, associating labels and values based on their spatial relationship.
List Match	Extracts values from document text that match any entry in a list of search terms.
Ordered OMR	Reads one or more checkboxes with a consistent order of appearance inside a rectangular region.
Pattern Match	Extracts values from document text that match a specified regular expression pattern.
Pii Entity Recognition	Identifies, categorizes, and redacts sensitive information (PII) in unstructured text using Azure AI Language Services.
Query HTML	Extracts values from an HTML document using a CSS or XPath selector.
Query XML	Extracts values from XML documents using XPATH queries, enabling structured data extraction from XML content in Grooper.
Read Barcode	Extracts barcode values from document images using configurable barcode recognition.
Read Metadata	Reads a metadata value from a document by accessing a property on an attachment or content link.
Read Zone	Extracts text content from a specified rectangular region (zone) of a document.
Select Page	Selects and outputs the full content of one or more pages from a document, based on page number and/or content criteria.
Word Match	Extracts individual words and multi-word phrases (N-grams) from document text for use in classification, data extraction, and normalization.
Zonal OMR	Reads one or more checkboxes using manually-configured zones.

The Value Extractor property defines the extraction logic for locating all possible values of interest on a document.

Select an extractor type (such as pattern match, list match, OMR, barcode, or zonal extraction) to match your data and document structure.
After selecting a type, configure its settings using the dynamic panel or property grid.
The extractor should be set up to return all possible candidates for the value you want to classify (for example, all dates on a page).

Extractor Type Categories

Text Parsing Extractors:
Use regular expressions or value lists to extract text-based values.
OMR Extractors:
Extract values from checkboxes or bubbles using optical mark recognition.
Barcode Extractors:
Read values encoded in barcodes.
Zonal Extractors:
Extract text from a defined region of the page.
Reference Extractor:
Reuse extraction logic from another Value Reader, Data Type, or Field Class.

> Note:
> Once the extractor type is changed from 'none', the property must be reset before another type can be selected.

Best Practices

Configure the extractor to return all possible candidates, even if some are not correct; the classifier will select the best match based on context.
Use the 'Tester' tab to validate and refine your extractor configuration before training the classifier.

Feature Extractor

Value Extractor

►

Specifies the Value Extractor used to find contextual features for training and classification.

Can be one of the following types:

Value	Description
Reference	Delegates extraction to another configured extractor, enabling reuse and centralization of extraction logic.
AI Column Extractor	Extracts structured content from documents with two-column layouts.
AI Schema Extractor	Extracts structured data from documents using a large language model (LLM) guided by a user-defined JSON schema.
Ask AI	Executes a completion using a large language model (LLM) and returns one hit for each choice in the response.
Detect Signature	Detects a signature within a specified region of a document page by measuring the percentage of the area that is filled.
Entity Recognition	Identifies and categorizes entities such as people, organizations, locations, and quantities in unstructured text.
Field Match	Matches the value stored in a previously-extracted field or table column.
Find Barcode	Searches for barcode values in document Layout Data previously detected during image processing.
Highlight Zone	Defines a region of a document to be visually highlighted, without extracting any data values.
Key Phrase Extraction	Identifies key concepts and topics in text using Azure AI Language key phrase extraction.
Label Match	Matches a list of one or more label values, using matching options defined by a Labeling Behavior.
Labeled OMR	Reads a group of one or more checkboxes located nearby text labels.
Labeled Value	Extracts a field presented as a label-value pair within a document, associating labels and values based on their spatial relationship.
List Match	Extracts values from document text that match any entry in a list of search terms.
Ordered OMR	Reads one or more checkboxes with a consistent order of appearance inside a rectangular region.
Pattern Match	Extracts values from document text that match a specified regular expression pattern.
Pii Entity Recognition	Identifies, categorizes, and redacts sensitive information (PII) in unstructured text using Azure AI Language Services.
Query HTML	Extracts values from an HTML document using a CSS or XPath selector.
Query XML	Extracts values from XML documents using XPATH queries, enabling structured data extraction from XML content in Grooper.
Read Barcode	Extracts barcode values from document images using configurable barcode recognition.
Read Metadata	Reads a metadata value from a document by accessing a property on an attachment or content link.
Read Zone	Extracts text content from a specified rectangular region (zone) of a document.
Select Page	Selects and outputs the full content of one or more pages from a document, based on page number and/or content criteria.
Word Match	Extracts individual words and multi-word phrases (N-grams) from document text for use in classification, data extraction, and normalization.
Zonal OMR	Reads one or more checkboxes using manually-configured zones.

The Feature Extractor property defines how contextual features (such as words, phrases, or labels) are identified near each candidate value found by the 'Value Extractor'. These features are used to train the classifier and to score candidates during extraction.

Select an extractor type that returns the kind of features most useful for distinguishing the correct value (for example, general words, labels, or specific patterns).
Configure the extractor to return features that appear in the context of the value you want to classify.
The context in which features are collected is determined by the 'Context Scope' property (e.g., flow, zonal, nearest, or self).

Usage Example

For a contract where you want to identify the "Effective Date", configure the feature extractor to return general words (excluding stop words). During training, features near the correct date (such as "lease", "entered", "between") will be positively weighted, while features near incorrect dates (such as "clerk", "filed", "archives") will be negatively weighted.

Best Practices

Choose a feature extractor that best captures the distinguishing context for your use case.
Use the 'Tester' tab to review which features are being returned and refine the extractor as needed.
Remember that the scope and number of features included are controlled by properties such as 'Context Scope' and 'Max Instances'.
Use a Lexicon of stop words to filter out words which are not helpful in classification (of, the, etc.)

> Note:
> Once the extractor type is changed from 'none', the property must be reset before another type can be selected.

Description

String

►

Specifies a description for the item.

Context Scope Options

Context Scope

ContextScopes

►

Determines the scope of context feature extraction.

Can be one of the following values:

Name	Value	Description
Zonal	0	Extracts context features from one or more zones (rectangular regions) defined relative to each value. Use Zonal when context features (such as labels or keywords) consistently appear in specific spatial regions near the value. This is common in semi-structured documents like invoices, forms, or tables, where labels are to the left, above, or in other predictable positions relative to the value. Define one or more 'Context Zones' to specify the regions to search for features. Optionally use an 'Exclusion Extractor' to filter out values near unwanted features. Works best when document layouts are consistent and features are spatially aligned with values. Example: On an invoice, extract the value for "Invoice Date" by defining a zone to the left of the date value to capture the label. > Note: > This mode is often replaced by a "Labeled Value" Value Extractor for ease of setup, but may be useful for advanced scenarios.	►
Flow	1	Extracts context features that occur before and/or after the value in the text flow (reading order). Use Flow when the context that distinguishes a value is found in the words or phrases that appear before or after it in the document's reading order. This is ideal for unstructured or natural language documents, such as contracts or correspondence. Use the 'Flow Filter' property to include features before, after, or on either side of the value. Limit the number of features with 'Max Instances'. Enable 'GeoTag Features' to tag features as before `(b)` or after `(a)`. Example: In a contract, extract the "Effective Date" by including features like "effective", "as of", or "between" that appear before the date. > Tip: > Use this mode when context is determined by language flow rather than spatial layout.	►
Self	2	Extracts context features that are contained within the value itself. Use Self when the features that define a value are inside the value's boundaries, rather than around it. This is useful for extracting and classifying paragraphs, clauses, or other self-contained text blocks. No additional context properties are required for this mode. All features found within the value are used for training and classification. Example: Identify a non-compete clause in a contract by extracting paragraphs and using the words within each paragraph as features. > Note: > This mode is best for scenarios where the value's internal content is the primary distinguishing factor.	►
Nearest	3	Extracts a limited number of features that are spatially closest to the value, with optional direction and distance filtering. Use Nearest for flexible, proximity-based context extraction. This mode selects the closest features to each value, regardless of strict zones or flow order, and allows filtering by geometric direction and distance. Use 'Geo Filter' to include features from specific directions (e.g., above, left). Limit the number of features with 'Max Instances'. Restrict the search radius with 'Max Distance'. Enable 'GeoTag Features' to tag features with their direction (e.g., `(n)` for north/above). Example: On a variable-layout form, extract the value for "Total" by including the nearest features above or to the left, regardless of exact position. > Tip: > This mode is ideal when document layouts are inconsistent or when context is best defined by proximity.	►

Overview

The ContextScopes enumeration controls the method used to gather context features for each candidate value in a Field Class. The context scope determines the spatial or logical relationship between a value and its surrounding features, which are used for machine learning-based classification and extraction.

Choosing the right context scope is essential for accurately identifying the correct value among multiple candidates, especially in documents where similar values may appear in different locations or contexts.

Available Modes

Zonal: Extracts features from one or more rectangular regions (zones) defined relative to each value.
Flow: Extracts features that occur before and/or after the value in the text flow (reading order).
Self: Extracts features that are contained within the value itself.
Nearest: Extracts a limited number of features that are spatially closest to the value, with optional direction and distance filtering.

Usage Guidance

Use Zonal for semi-structured documents where labels or key features consistently appear in fixed locations relative to the value (e.g., invoices).
Use Flow for natural language or unstructured documents where context is determined by surrounding words or phrases (e.g., contracts).
Use Self when the features that define a value are contained within it (e.g., paragraphs or clauses).
Use Nearest for flexible, proximity-based context extraction, especially when document layouts vary or context is not strictly zonal or flow-based.

The selected context scope affects which additional properties are visible and configurable, such as 'Context Zones', 'Max Instances', 'Geo Filter', 'Flow Filter', 'Max Distance', and 'GeoTag Features'.

> Tip:
> Experiment with different context scopes and review the extracted features in the 'Tester' tab to determine which mode best captures > the distinguishing context for your use case.

For more information, see the documentation for Field Class, Value Extractor, and related context extraction properties.

Max Instances

Int32

►

Specifies the maximum number of context features to include for each candidate value.

Direction Filter

CompassDirection

►

When using a Context Scope of Nearest, limits the context features included based on their geometric position relative to the data value.

A combination of the following flags:

Name	Value	Description
North	1	Above the data value. Values on a previous page are also considered north.
South	2	Below the data value. Values on a subsequent page are also considered south.
West	4	Left of the data value.
East	8	Right of the data value.
NorthWest	16	Above and left of the data value.
NorthEast	32	Above and right of the data value.
SouthWest	64	Below and left of the data value.
SouthEast	128	Below and right of the data value.
All	255	All Directions

Overview

The CompassDirection enumeration is used in Field Class objects to filter context features by their geometric direction relative to a candidate value. This is only applicable when the 'Context Scope' property is set to Nearest. It enables fine-grained spatial filtering, allowing you to include or exclude features based on their position (such as above, below, left, right, or diagonal) with respect to the value being classified.

Multiple directions can be combined using bitwise flags. For example, to include only features above or to the left of a value, use:

The special value All includes features from all directions.

Usage Examples

To focus on likely label positions, include only features above (North) or to the left (West) of a value.
To reduce noise, exclude features in certain directions (for example, ignore features below a value).
Combine with the 'Max Distance' property to restrict the spatial region considered for context features.

Special Behaviors

Features on a previous page are treated as north; features on a subsequent page are treated as south.
When 'GeoTag Features' is enabled, features are tagged with their direction (for example, (n) for north).

Related Properties

'Geo Filter': Specifies which directions to include when extracting context features in Nearest mode.
'Max Distance': Limits the maximum distance (in inches) for included features.
'GeoTag Features': When enabled, appends a direction tag to each feature.

For more information, see the documentation for Field Class, 'Context Scope', and related extraction settings.

Position Filter

FlowDirection

►

Specifies the relative position of context features in the text flow with respect to a candidate value.

Can be one of the following values:

Name	Value	Description
Before	1	Occurs before the data value in the text flow. Use `Before` to include context features that appear earlier in the text flow than the candidate value. This is useful for capturing labels, headings, or introductory phrases that help identify the meaning or type of the value. For example, in the phrase: the word "Invoice Date" would be a feature occurring before the value "01/01/2024". When 'GeoTag Features' is enabled, features before the value are tagged with `(b)`.	►
After	2	Occurs after the data value in the text flow. Use `After` to include context features that appear later in the text flow than the candidate value. This is useful for capturing units, qualifiers, or trailing context that may clarify or modify the value. For example, in the phrase: the word "USD" would be a feature occurring after the value "12345". When 'GeoTag Features' is enabled, features after the value are tagged with `(a)`.	►
Either	3	Occurs before or after the data value in the text flow. Use `Either` to include context features that appear both before and after the candidate value in the text flow. This is the most inclusive option and is useful when relevant context may appear on either side of the value. For example, in the phrase: both "Invoice Date" (before) and "(Effective)" (after) would be included as context features. When 'GeoTag Features' is enabled, features are tagged with `(b)` or `(a)` as appropriate.	►

Overview

The FlowDirection enumeration is used in Field Class objects to filter context features by their position in the text flow relative to a candidate value. This is only applicable when the 'Context Scope' property is set to Flow. It enables filtering of features that occur before, after, or on either side of the value in the document's reading order.

This is useful for natural language documents, such as contracts or unstructured text, where the meaning of a value is often determined by the words or phrases that appear before or after it in the text.

Multiple directions can be combined using bitwise flags. For example, to include features both before and after a value, use:

Usage Examples

To focus on features that precede a value (such as labels or introductory phrases), use Before.
To focus on features that follow a value (such as units, qualifiers, or trailing context), use After.
To include features on both sides, use Either.

Special Behaviors

When 'GeoTag Features' is enabled, features are tagged with (b) for before and (a) for after.
The 'Max Instances' property limits the number of features included from the selected directions.

Related Properties

'Flow Filter': Specifies which directions to include when extracting context features in Flow mode.
'Max Instances': Limits the maximum number of features included.
'GeoTag Features': When enabled, appends a direction tag to each feature.

For more information, see the documentation for Field Class, 'Context Scope', and related extraction settings.

Context Zones

Rectangle[]

►

Defines rectangular zones, relative to each candidate value, from which context features will be extracted.

Exclusion Extractor

Value Extractor

►

An optional extractor that excludes candidate values if specific features are found in their context zones.

Can be one of the following types:

Value	Description
Reference	Delegates extraction to another configured extractor, enabling reuse and centralization of extraction logic.
AI Column Extractor	Extracts structured content from documents with two-column layouts.
AI Schema Extractor	Extracts structured data from documents using a large language model (LLM) guided by a user-defined JSON schema.
Ask AI	Executes a completion using a large language model (LLM) and returns one hit for each choice in the response.
Detect Signature	Detects a signature within a specified region of a document page by measuring the percentage of the area that is filled.
Entity Recognition	Identifies and categorizes entities such as people, organizations, locations, and quantities in unstructured text.
Field Match	Matches the value stored in a previously-extracted field or table column.
Find Barcode	Searches for barcode values in document Layout Data previously detected during image processing.
Highlight Zone	Defines a region of a document to be visually highlighted, without extracting any data values.
Key Phrase Extraction	Identifies key concepts and topics in text using Azure AI Language key phrase extraction.
Label Match	Matches a list of one or more label values, using matching options defined by a Labeling Behavior.
Labeled OMR	Reads a group of one or more checkboxes located nearby text labels.
Labeled Value	Extracts a field presented as a label-value pair within a document, associating labels and values based on their spatial relationship.
List Match	Extracts values from document text that match any entry in a list of search terms.
Ordered OMR	Reads one or more checkboxes with a consistent order of appearance inside a rectangular region.
Pattern Match	Extracts values from document text that match a specified regular expression pattern.
Pii Entity Recognition	Identifies, categorizes, and redacts sensitive information (PII) in unstructured text using Azure AI Language Services.
Query HTML	Extracts values from an HTML document using a CSS or XPath selector.
Query XML	Extracts values from XML documents using XPATH queries, enabling structured data extraction from XML content in Grooper.
Read Barcode	Extracts barcode values from document images using configurable barcode recognition.
Read Metadata	Reads a metadata value from a document by accessing a property on an attachment or content link.
Read Zone	Extracts text content from a specified rectangular region (zone) of a document.
Select Page	Selects and outputs the full content of one or more pages from a document, based on page number and/or content criteria.
Word Match	Extracts individual words and multi-word phrases (N-grams) from document text for use in classification, data extraction, and normalization.
Zonal OMR	Reads one or more checkboxes using manually-configured zones.

The Exclusion Extractor property allows you to specify a Value Extractor that identifies features which, if present within any context zone of a candidate value, will cause that value to be excluded from the results.

Use this to filter out values that are near unwanted or disqualifying features (such as "void", "sample", or other exclusionary terms).
The exclusion extractor is run for each candidate value, and if any matches are found within the defined context zones, the candidate is removed from consideration.

Usage Example

To prevent extraction of values labeled as "void", configure an exclusion extractor that matches the word "void" and add a context zone covering the label area.

> Note:
> This property is only applicable when 'Context Scope' is set to Zonal.

Maximum Distance

Double

►

Specifies the maximum spatial distance (in inches) between a context feature and a candidate value when using nearest context mode.

GeoTag Features

Boolean

►

If enabled, features will be tagged with suffixes indicating their position relative to the candidate value.

Classifier Tuning

Minimum Feature Count

Int32

►

Specifies the minimum number of context features required for a candidate value to be considered valid for training or output.

Training Threshold

Double

►

Determines the similarity threshold for negative instance training during classifier learning.

Use Class Frequency

Boolean

►

When enabled, the classifier incorporates class frequency (CF) into feature weighting, modifying the standard TF-IDF approach.

Sublinear TF Scaling

Boolean

►

When enabled, term frequency (TF) values are scaled logarithmically in the classifier's feature weighting.

Smooth IDF

Boolean

►

Controls whether smoothing is applied to the inverse document frequency (IDF) calculation in the classifier.

Output

Minimum Confidence

Double

►

Specifies the minimum confidence score required for a candidate value to be included in the output.

Collation Method

CollationType

►

Controls how results from individual extractors are processed into a final output result.

Can be one of the following values:

Name	Value	Description
Individual	0	All qualifying results will be returned individually. Each qualifying value is output as a separate result. Use this setting when you want to capture all possible matches, such as listing all dates or names found in a document. This is the default behavior for most Field Class extractions. > Example: > If three dates are found, three separate results will be output.	►
Combine	1	All qualifying results will be combined into a single instance. All qualifying values are concatenated into a single output, using the specified 'Separator' string. Use this setting to join multiple values into a single field, such as combining address lines or listing multiple codes. The 'Separator' property controls the delimiter between values (e.g., comma, space, or newline). The 'Maximum Page Distance' property can be used to limit which values are combined based on their page location. > Example: > If three names are found and the separator is ", ", the output will be "Alice, Bob, Carol".	►
Boolean	2	If any qualifying results are found, returns "True". Otherwise, returns "False". Returns a Boolean value indicating the presence or absence of any qualifying result. Use this setting for Yes/No or presence/absence fields, such as "Has Signature" or "Contains Clause". If at least one qualifying value is found, the output is `"True"`. If none are found, the output is `"False"`. > Example: > If any matching value is found, output is "True". If none are found, output is "False".	►

The Collation Type determines how multiple qualifying results from a Field Class are presented in the output.

> Tip:
> The available collation method may affect which additional properties are visible, such as 'Separator' or 'Maximum Page Distance'.

Separator

String

►

The string used to separate values when combining multiple instances into a single output.

Maximum Page Distance

Int32

►

Specifies the maximum allowed page distance between instances to be combined into a single output.

Order By

SortOrder

►

Defines the sort order of the result set.

Can be one of the following values:

Name	Value	Description
Position	0	Results are ordered by their position within the content flow. Sorts by page index, then by character index within the page. Use this to maintain the natural reading or extraction order as it appears in the document.	►
Frequency	1	Results are ordered by the number of occurrences of each distinct value. Sorts by the frequency with which each value appears in the collection (most frequent first by default). Useful for grouping or prioritizing repeated values, such as common labels or recurring data.	►
Confidence	2	Results are ordered by confidence. Sorts by the confidence score for each result (highest first by default). Use this to prioritize the most likely or reliable extraction results.	►
Extractor	3	Results are ordered by the extractor which produced each match. Sorts by the originating extractor's order in the configuration. Useful when combining results from multiple extractors and you want to preserve or group by extractor source.	►
Length	4	Results are ordered by the length of the value. Sorts by the number of characters in each value (longest or shortest first). Useful for preferring more specific or more general matches, such as longer phrases or shorter keywords.	►
Value	5	Results are ordered by value. Sorts alphabetically or numerically by the value content, using the configured Storage Type if available. Use this for dictionary order, numeric order, or to group similar values together.	►
CoordinateX	6	Results are ordered by position along the X axis. Sorts by the left coordinate of each result's bounding box, then by page index. Useful for left-to-right ordering on the page, such as columns or tabular data.	►
CoordinateY	7	Results are ordered by position along the Y axis. Sorts by the top coordinate of each result's bounding box, then by page index. Useful for top-to-bottom ordering on the page, such as rows or vertical lists.	►

The Sort Order enumeration specifies how a collection of Data Instance results will be sorted for output, display, or further processing.

Sorting can be based on position, frequency, confidence, extractor, value length, value content, or geometric coordinates.

Direction

SortDirection

►

Controls whether the output is sorted in ascending or descending order.

Can be one of the following values:

Name	Value	Description
Ascending	0	Results are returned in ascending order, where smaller values appear before larger values.
Descending	1	Results are returned in descending order, where larger values appear before smaller values.

Design Tabs

General	View or edit properties of a node.
Reports	View reports for a node.
Tester	Test an Extractor Node on documents in a test batch.
Weightings	View the classification weightings associated with this Field Class.
Advanced	View or edit advanced details about a node.

Context Menu Commands

Command	Shortcut	Description
bolt Purge Training		Deletes all training data from this Field Class.

Child Types

Data Type Value Reader

Used By

Reference Render

Field Class

Remarks

Overview

Configuration

Training & Usage

Best Practices

Properties

Typical Usage

Extractor Type Categories

Best Practices

Usage Example

Best Practices

Overview

Available Modes

Usage Guidance

Usage Example

Overview

Usage Examples

Special Behaviors

Related Properties

Overview

Usage Examples

Special Behaviors

Related Properties

Usage Example

Usage Example

Usage Example

Usage Example

Usage Example

Usage Example

Usage Example

Usage Example

Usage Example

Usage Example

Usage Example

Usage Example

Design Tabs

Context Menu Commands

Child Types

See Also

Used By