Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Quoting Method

Embedded Object Grooper.GPT

Quoting methods provide various mechanisms for feeding quotes from a document into AI chat operations.

Remarks

Quoting methods define how content is selected from a document and formatted as input for large language model (LLM) completions. Their primary role is to extract, transform, and present relevant document content—referred to as a "quote"—to the AI in a way that maximizes context, accuracy, and efficiency.

Theory and Purpose

Modern LLMs are highly sensitive to the context and structure of their input. The quality and relevance of the information provided to the model directly impacts its ability to generate accurate, useful responses. Quoting methods serve as the bridge between raw document data and the AI, ensuring that only the most pertinent, well-structured, and cost-effective content is included in the prompt.

  • Targeted Context: By selecting only the necessary portions of a document (such as a specific field, table, or region), quoting methods help focus the model's attention and reduce noise.
  • Cost Efficiency: LLMs have input token limits and usage costs. Quoting methods help minimize prompt size by including only what is needed, avoiding unnecessary or redundant content.
  • Context Awareness: Some quoting methods apply preprocessing (such as tab or paragraph marking) or structure the data (such as JSON or layout schemas) to help the AI better interpret complex layouts, tables, or natural language.
  • Adaptability: Different tasks (extraction, summarization, classification, etc.) may require different quoting strategies. The quoting method can be selected or configured to match the needs of the operation.

How Quoting Methods Work

A quoting method typically:

  • Defines the scope of content to extract (entire document, specific region, extracted values, etc.).
  • Optionally applies preprocessing or formatting to the content.
  • Packages the result in a format suitable for the AI (plain text, JSON, layout schema, etc.).
  • Supplies the quote as part of the prompt to the LLM, often with a standard prefix or instructions.

Examples

  • Extracted: Selects all or part of the document, optionally applying preprocessing.
  • Labeled Region: Extracts content following a header label, useful for tables or sections.
  • Data Values: Supplies the full Data Model hierarchy as JSON.
  • Layout Objects: Provides a structured map of the document's visual layout.

Derived Types

There are 5 implementations of Quoting Method.

Data Values Supplies the AI with extracted document data in JSON format.
Extracted Selects all or part of the document, optionally applying preprocessing options.
Labeled Region Selects content occurring after a header label.
Layout Objects Supplies the LLM with the location and content of all text segments, lines, barcodes, checkboxes, and etc.
Semantic Selects portions of the document which are semantically similar to a set of examples.

Used By

Notification