Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

AI Extract Section Options

Data Section Data Element Extension Grooper.GPT

Defines options controlling how AI Extract works for a Data Section.

Remarks

The 'AI Extract Section Options' class provides advanced configuration for how Grooper's AI Extract method interacts with a specific Data Section. These options allow you to control the alignment strategy used by the LLM (Large Language Model) when extracting section instances, ensuring that each section is accurately located and mapped back to the document for review, navigation, and validation.

Role in Grooper

  • Used to customize the extraction and alignment behavior for individual Data Sections when using AI Extract.
  • Supports both single-instance and multi-instance sections, adapting to a wide range of document layouts and data grouping scenarios.
  • Enables integration of section-specific alignment instructions into the LLM prompt, improving extraction accuracy and reliability.

How It Works

  • Each Data Section can be extended with an 'AI Extract Section Options' object, allowing you to override default alignment behavior.
  • The 'Section Alignment' property determines what information is requested from the LLM (such as quotes, bounding boxes, or labels) and how Grooper interprets the LLM's output to identify section boundaries in the document.
  • These settings are especially important for multi-instance sections, where accurate splitting and alignment of repeated content is required.

Configuration and Usage

  • Choose an alignment mode that matches the structure of your documents and the nature of the section being extracted.
  • For simple, single-instance sections, alignment may not be necessary. For repeating or complex sections, use quoting or geometric modes to improve accuracy.
  • The selected mode impacts the JSON schema sent to the LLM, the expected response format, and the mechanism used to locate section instances in the document.

Example Scenario

For a document with multiple repeating claim sections, you might use 'StartQuote' to have the LLM return a quote from the first line of each section, allowing Grooper to split the document accordingly. For a section with a well-defined geometric region, use 'Geometric' to request explicit page and bounding box information.

These options provide granular control over AI-driven section extraction, supporting robust, accurate, and context-aware data capture in Grooper.

Properties

NameTypeDescription
Notification