Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Extractor Node - Tester

Extractor Node Design Tab GrooperReview.Pages.Design

Test an Extractor Node on documents in a test batch.

Remarks

The Extractor Node Tester tab provides a comprehensive, interactive workspace for designing, configuring, and testing Extractor Nodes in Grooper, such as Data Types, Field Classes, and Value Readers. This tab is intended for solution designers, administrators, and subject matter experts who need to validate extraction logic, review results, and troubleshoot extractor configuration using real-world documents and immediate feedback.

Purpose

Use this tab to build and refine extractor logic (such as regular expressions, collation patterns, or value extractors), run extraction tests against sample documents, and review results in a structured, visual format. The integrated workspace streamlines the process of developing, validating, and troubleshooting extractors, supporting rapid iteration and diagnostics.

UI Components

  • Toolbar:
    • Action buttons for testing, toggling auto-test, viewing diagnostics, launching the Data Inspector, visualizing regular expressions, training, and more.
    • AI build button for natural language regex authoring (if enabled).
  • Tab List:
    • Switch between "Expressions" and "Properties" views for extractor configuration.
  • Property Grid:
    • Edit extractor properties with contextual help and validation.
  • Test Source Panel:
    • Select the document or data instance to use as input for extraction tests.
  • Document Viewer:
    • Preview the selected document and highlight extraction results.
  • Result List:
    • Review all extraction results, with support for selection, sorting, and filtering.
  • Page Navigator:
    • Navigate through multiple pages of extraction results.

Interactive Features

  • Automatic and Manual Testing:
    • Enable or disable auto-test mode. When enabled, tests run automatically after each change; otherwise, use the test button.
  • Result Selection and Highlighting:
    • Selecting a result in the list highlights the corresponding region in the document viewer.
  • Diagnostics and Inspection:
    • View detailed diagnostics for the last test, or launch the Data Inspector for in-depth review.
  • Training and Visualization:
    • Mark results as positive/negative for training, or visualize regular expressions and extraction logic.
  • Context Zone Definition:
    • Use the rubberband tool to define context zones by drawing rectangles on the document image.
  • AI Regex Generation:
    • Use the build button to generate or modify regular expressions from natural language instructions (if AI tools are enabled).

Workflow and Scope

  • The tab is available for any Extractor Node in the design tree.
  • Edit extractor properties and expressions using the property grid and tab list.
  • Select a test source document to validate extraction patterns in real time.
  • Enable auto-test for rapid iteration, or use the test button for manual control.
  • Review results in the result list and use the pager for large result sets.
  • Use the document viewer to visually confirm extraction accuracy and context.
  • Access diagnostics, training, and visualization tools for advanced troubleshooting and optimization.
  • Use the AI build button to generate or refine regular expressions from natural language instructions (if available).

Example

The following diagram illustrates the layout of the Extractor Node Tester tab:

┌────────────────────────────────────────────────────────────────────────────────────────────┐
│ Extractor Node Tester                                {build} {test} {toggle} {diagnostics} │
├────────────────────────────────────────────────────────────────────────────────────────────┤
│ ┌───────────── Tab List ─────────────┐ ┌──────────────── Document Viewer ────────────────┐ │
│ │ {Expressions} {Properties}         │ │                                                 │ │
│ │------------------------------------│ │                                                 │ │
│ │  Expression Grid or Property Grid  │ │                                                 │ │
│ │                                    │ └─────────────────────────────────────────────────┘ │
│ └────────────────────────────────────┘ ┌────────────────── Result List ──────────────────┐ │
│ ┌───────────── Test Source ──────────┐ │ {Pager} {inspect} {visualize} {weightings}      │ │
│ │                                    │ │ {train_positive} {train_negative} {rubberband}  │ │
│ └────────────────────────────────────┘ └─────────────────────────────────────────────────┘ │
└────────────────────────────────────────────────────────────────────────────────────────────┘
  • {build}: AI regex authoring (if enabled)
  • {test}: Run extraction test
  • {toggle}: Enable/disable auto-test
  • {diagnostics}: View diagnostics for last test
  • {Expressions}/{Properties}: Tab list for extractor configuration
  • Expression Grid: Edit regular expressions
  • Property Grid: Edit extractor properties
  • Test Source: Select test document or data instance
  • Document Viewer: Preview and highlight extraction results
  • {Pager}: Navigate result pages using a Page Navigator
  • {inspect}: Launch Data Inspector
  • {visualize}: Visualize regex/extraction logic
  • {weightings}: View scoring details (Field Class only)
  • {train_positive}/{train_negative}: Train result as positive/negative (Field Class only)
  • {rubberband}: Define context zone (Field Class only)
  • Result List: Review and select extraction results

Accessibility

  • Fully navigable by keyboard and mouse.
  • Visual cues and highlighting indicate the selected extractor, test status, and errors.
  • Tooltips and context menus provide guidance for available actions.

Advanced Features

  • Immediate Feedback: Extraction results and diagnostics are updated in real time after each test run.
  • Diagnostics and Data Inspector: Access error messages, warnings, and performance metrics for troubleshooting extraction logic.
  • Auto-Run Support: Optionally run extraction tests automatically when a new selection is made.
  • Training and Visualization: Mark results for training, visualize regular expressions, and define context zones interactively.
  • AI-Assisted Authoring: Use natural language instructions to generate or refine regular expressions (if enabled).

Child Controls

NameTypeSummary

Used By

Notification