Grooper Help - Version 25.0
25.0.0023 2,165
  • Overview
  • Help Status

Extracted

Quoting Method Grooper.GPT

Selects all or part of a document as a quote for the AI model, with optional preprocessing for the text content.

Remarks

The Extracted quoting method provides flexible control over what content from a document is supplied as a quote to the AI model in Grooper's AI-powered activities. It is designed for scenarios where you want to include only specific portions of a document, or to apply preprocessing to the content before it is sent to the AI.

Overview

Extracted enables you to:

  • Target specific fields, patterns, or regions within a document using a Value Extractor.
  • Quote the entire document if no extractor is specified.
  • Apply text preprocessing (such as tab or paragraph marking) to improve AI understanding of complex layouts or natural language.
  • Choose whether extraction is performed on the entire Document Instance or a specific field/section.

This method is suitable for both targeted and full-document quoting, supporting advanced prompt engineering and context control for AI models.

How It Works

  1. Extraction Scope:
    The extraction scope is determined by the 'Document Level' property. When enabled, extraction is performed on the entire Document Instance; otherwise, it operates on the current field or section context.

  2. Value Extraction:
    If a 'Quote Extractor' is specified, all values matched by the extractor are concatenated and used as the quote. If not set, the entire document (or scope) is quoted.

  3. Preprocessing:
    If preprocessing options are configured, they are applied to the extracted content. This can include marking tabs, paragraphs, or other structural elements to help the AI model interpret the text more accurately.

  4. Quote Assembly:
    The final quote is assembled and supplied to the AI model as part of the prompt or context.

Typical Use Cases

  • Supplying only the most relevant extracted values to the AI, reducing prompt size and focusing the model's attention.
  • Improving AI understanding of tables, lists, or natural language by marking structural elements.
  • Quoting the entire document when no extractor is specified, with or without preprocessing.

Configuration Guidance

  • Set the 'Quote Extractor' property to a Value Extractor to target specific content.
    Example: Use a regular expression extractor to select invoice numbers or dates.
  • Use the 'Preprocessing' property to enable tab or paragraph marking for documents with complex layouts.
  • Enable 'Document Level' to ensure extraction covers the whole document context, especially for document-wide prompts.

Best Practices

  • Use targeted extraction to minimize prompt size and improve AI relevance.
  • Apply preprocessing when working with tabular data or multi-line paragraphs to enhance extraction accuracy.
  • Test different configurations to find the optimal balance between context size and AI performance.

Properties

NameTypeDescription

See Also

Used By

Notification