Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Cluster Parameters

Embedded Object Grooper.GPT

Configures how semantically similar chunks are grouped into a unified quote region.

Remarks

The Cluster Parameters class defines the logic for combining high-scoring chunks of text into a single, contextually relevant quote when performing semantic search in Grooper. Clustering helps ensure that extracted content is not limited to isolated fragments, but instead forms a coherent region that accurately represents the intended clause or passage.

Clustering is especially important when semantic matches are distributed across adjacent or overlapping segments. By tuning the parameters in this class, you can control the strictness, size, and continuity of the resulting quote region.

How Clustering Works

  • After scoring all document chunks for semantic similarity, the top results are evaluated for inclusion in the cluster.
  • Absolute and relative thresholds are applied to filter out low-quality or marginally related matches.
  • Additional criteria, such as overlap requirements, help ensure that only contiguous or closely related chunks are grouped.

Proper configuration of clustering parameters is essential for balancing precision and recall in semantic extraction tasks.

Properties

NameTypeDescription

Used By

Notification