Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Multiline Row Settings

Embedded Object Grooper.Extract

Enables or configures capture of multi-line table row content in tabular extraction.

Remarks

The Multiline Row Settings class controls how Tabular Layout detects and assembles table rows that span multiple text lines. In many real-world documents, a single logical table row may be split across several lines due to text wrapping, stacked header layouts, or the inclusion of free-form comments and notes.

By configuring these settings, you can ensure that all relevant content is captured for each row, even when it is not confined to a single line.

How Multi-Line Row Detection Works

When enabled, multi-line row detection extends each detected table row to include additional lines that are determined to be part of the same logical row. This is especially useful for:

  • Text Wrapping: Cell values that wrap onto subsequent lines.
  • Stacked Layouts: Vertically stacked header cells with corresponding stacked values in each row.
  • Free-Form Content: Comments, notes, or other information that appears below the main row data.

The detection process uses configurable limits for the number of lines, vertical spacing, and leading lines, and can be further tuned for page-wrapped or stacked layouts.

Configuration Guidance

  • Use 'Maximum Lines Per Row' to limit how many lines can be grouped into a single row.
  • Set 'Maximum Leading Lines' to allow for lines that precede the main row content, such as descriptions or comments.
  • Adjust 'Maximum Line Spacing' to control how much vertical space is allowed between lines in a row.
  • Enable 'Detect Page Wrap' to support rows that continue across page breaks.
  • Enable 'Detect Stacked Layout' for tables with vertically stacked header and value cells.

Example Scenarios

  • Invoice Line Items: Capture item descriptions that wrap onto multiple lines below the main row.
  • Stacked Headers: Extract values from rows where each value is stacked vertically under its header.
  • Free-Form Notes: Include comments or notes that appear below each row as part of the row's data.

Best Practices

  • Test with a variety of sample documents to ensure multi-line detection is robust and does not over- or under-capture content.
  • Use diagnostic output to review which lines are grouped into each row.
  • Adjust limits and enable/disable features as needed to match your document layout.

For more information, see the documentation for Tabular Layout, Table Row Detector, and Data Table.

Properties

NameTypeDescription

Used By

Notification