Grooper Help - Version 25.0
25.0.0029 2,194

Table Instance - Capture Rows

Table Instance Command GrooperReview.Views.DataGrid.Commands

Capture table rows using a regular expression in a Data Grid Table.

Remarks

The Capture Rows command enables ad-doc extraction of tabular data from documents by identifying and capturing table rows using a configurable regular expression pattern. This command is designed for scenarios where table rows are not easily detected by layout or delimiter-based methods, such as when rows are separated by variable whitespace, embedded in unstructured text, or require context-sensitive matching.

How It Works

  • The command scans the source text for matches to the configured 'Pattern', optionally constrained by a 'Prefix' and/or 'Suffix'.
  • Each match is treated as a candidate table row. If 'Lines Per Row' is set, rows are grouped by a fixed number of lines.
  • Extracted rows are added to the table, and cell values may be mapped from named groups in the regular expression.
  • The command supports both fixed and variable row lengths, enabling flexible extraction from diverse document formats.

Configuration & Usage

  • Use the 'Pattern' property to define the regular expression that matches the desired row content.
  • Optionally set 'Prefix' and/or 'Suffix' to require context before or after each row.
  • If the table has a fixed number of lines per row, set 'Lines Per Row' to the appropriate value.
  • The command is ideal for extracting line items, transaction logs, or any repeating data that cannot be captured by standard table extract methods.

Extraction Workflow

  1. The command locates all matches for the configured pattern in the source text.
  2. For each match, a new row is created in the table.
  3. If named groups are defined in the pattern, their values are mapped to table cells.
  4. Rows are sorted, rebuilt, and validated to ensure data integrity.

Diagnostic Artifacts

  • Extraction results are available as Data Instance objects for each captured row.
  • Diagnostic logs may include details about pattern matches, row grouping, and validation outcomes.

Best Practices

  • Test your regular expression thoroughly to ensure it matches only the intended rows.
  • Use named groups in the pattern to directly capture cell values.
  • Adjust 'Prefix' and 'Suffix' to avoid false positives or overlapping matches.
  • Set 'Lines Per Row' only if the table structure is strictly fixed; otherwise, leave it unset for variable-length rows.

For more information, see the documentation for Data Table, Data Column, and Pattern Match extract methods.

Properties

NameTypeDescription

See Also

Notification