Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Transaction Layout Detection

Grooper Object Grooper.Extract

Detects transaction boundaries in documents by analyzing layout features such as lines, boxes, and whitespace areas.

Remarks

The Transaction Layout Detection class enables Grooper to identify the start and end of transactions or sections within a document by considering visual presentation elements. This includes horizontal and vertical lines, boxes, and whitespace areas that often separate or group related content.

How It Works

When extracting transactions, this class uses the configured 'Minimum Width' and 'Maximum Distance' properties to determine which lines of text should be grouped together as part of the same transaction. By leveraging layout cues, it can more accurately segment complex documents where text alone is insufficient for reliable separation.

Configuration Guidance

  • Set 'Minimum Width' to exclude narrow sections that are unlikely to represent valid transactions.
  • Adjust 'Maximum Distance' to control how far apart lines of text can be and still be considered part of the same transaction.
  • Both properties accept values in logical units (such as inches or percentages), allowing for flexible configuration across different document types and layouts.

Usage Example

To configure Transaction Layout Detection for a document with wide transaction rows and moderate spacing:

layoutDetection.MinimumWidth = "75%"; layoutDetection.MaximumDistance = "0.5in";

This configuration will only return sections that are at least 75% of the page width and will group lines that are within half an inch of each other.

Notes

Properties

NameTypeDescription

Used By

Notification