Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Labeling Behavior

Behavior Grooper.Extract

Enables Label Set-based extraction and configures label matching parameters for this Content Type and its descendants.

Remarks

The Labeling Behavior activates label-driven extraction for a Content Type, allowing you to define and use Label Sets for data extraction and classification.

What It Does

When to Use

  • Use this behavior when extracting data from semi-structured documents where field locations and labels may vary between document types, but the data model remains consistent.
  • Ideal for onboarding new document types quickly: simply create a new Label Set for each type, mapping document labels to Data Elements.

How It Works

  • After adding Labeling Behavior to a Content Type, refresh the Design Page to access the Labels tab.
  • Define one or more Label Sets, associating document text labels with Data Fields, Data Sections, and Data Tables in the Data Model.
  • Configure label and header similarity thresholds to control how closely a document label must match a defined label for extraction to occur.
  • Use the provided options to fine-tune fuzzy matching and wrapping behavior for complex layouts.

Example Scenario

Suppose you have invoices from multiple vendors, each using different terminology for the same fields (e.g., "Invoice Number", "Inv #", "Bill No."). By enabling LabelingBehavior and defining a Label Set for each vendor, you can map all variations to a single Data Field, streamlining extraction and reducing maintenance.

Related Concepts

Properties

NameTypeDescription

See Also

Recommended Content

Notification