Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Labeled Region

Quoting Method Grooper.GPT

Selects content occurring after a header label.

Remarks

The LabeledRegion quoting method extracts a region of content from a document that appears after a header label, optionally ending at a footer label or after a specified number of lines. This method is ideal for quoting tables, sections, or other structured regions that are demarcated by recognizable headers and/or footers.

How It Works:

  • The method locates header labels using either a Value Extractor or a Label Set associated with the Content Type.
  • Optionally, a footer label can be specified to mark the end of the region.
  • The number of lines to include after the header can be limited using the 'Maximum Lines' property.
  • Preprocessing options such as tab or paragraph marking can be applied to the extracted region to improve AI understanding of tabular or natural language content.

Configuration:

  • Use the 'Header Extractor' property to define how the header label is identified.
  • Use the 'Footer Extractor' property to define how the footer label is identified (optional).
  • Set 'Maximum Lines' to limit the number of lines included after the header (optional).
  • Use the 'Preprocessing' property to apply text preprocessing to the extracted region.

Use Cases:

  • Extracting tables or sections that follow a specific label in semi-structured documents.
  • Supplying the AI with only the relevant region of a document, reducing prompt size and focusing the model's attention.
  • Improving extraction accuracy for documents with repeating or variably-placed regions.

Properties

NameTypeDescription

See Also

Used By

Notification