Overview
Help Status
Activity
- Attended Activity
  - Review
- Code Activity
  - Apply Rules
  - Attach
  - Batch Transfer
  - Burst Book
  - Classify
  - Clip Frames
  - Convert Data
  - Correct
  - Deduplicate
  - Detect Frames
  - Detect Language
  - Detect Language (Legacy)
  - Dispose Batch
  - Execute
  - Export
  - Extract
  - Fill Data
  - GPT Embed
  - Image Processing
  - Initialize Card
  - Launch Process
  - Mark Attachments
  - Merge
  - Recognize
  - Redact
  - Remove Level
  - Render
  - Route
  - Send Mail
  - Separate
  - Spawn Batch
  - Split Pages
  - Split Text
  - Text Transform
  - Train Lexicon
  - Translate
  - XML Transform
Article
- AI Assistants
- AI Powered Features
- Batch Processing Workflows
- Document Scanning
- PDF Processing
- Search Overview
Attachment Type
- EDI File
- HTML Document Base
  - HTML Document
  - Mail Message
- JPEG Image
- JSON File
- Office Document
  - Excel Document
  - Power Point Document
  - Word Document
- PDF Document
- PST File
- Text Document
- TIFF Document
- vCard
- XML File
- ZIP Archive
Behavior
- Export Behavior
- Import Behavior
- Indexing Behavior
- JSON Data Mapping
- Labeling Behavior
- PDF Data Mapping
- Separation Behavior
- Text Rendering
Capture Device
- ColorTrac Scanner
- Import Device
- ISIS Device
- TWAIN Device
Classify Method
- ESP Classify Method
  - Lexical
  - Rules-Based
  - Search Classifier
- Labelset-Based
- LLM Classifier
- Visual
CMIS Binding
- CMIS
- Custom Binding
  - AppXtender
  - Base FTP Binding
    - FTP
    - SFTP
  - Box
  - Exchange
  - FileBound
  - IBM FileNet Connector
  - IMAP
  - NTFS
  - OneDrive
  - SharePoint
CMISQL Element
- CMISQL Query
- Join Clause
- ORDER BY Element
- Select Element
- Where Predicate
  - AT_LEVEL Predicate
  - Comparison Predicate
  - CONTAINS Predicate
  - IN Predicate
  - MATCHES Predicate
  - Predicate List
  - Scope Predicate
Collation Provider
- Base Combining Provider
  - AND
  - Base Array Provider
    - Array
    - Ordered Array
      - Key-Value List
      - Key-Value Pair
  - Combine
- Individual
- Multi-Column
- Pattern-Based
- Split
Command
- Action List - Create Copy Actions
- AI Chat
  - AI Chat - Delete
  - AI Chat - Rename
- Attachment Type
  - Attachment Type - Change Extension
  - Attachment Type - Remove Attachment
  - Attachment Type - Remove PDF Version
  - Attachment Type - Rename Attachment
- Batch
  - Batch - Archive
  - Batch - Change Priority
  - Batch - Combine
  - Batch - Pause
  - Batch - Remove Job History
  - Batch - Reset
  - Batch - Resume
  - Batch - Send To Production
  - Batch - Send To Test
  - Batch - Update Process
- Batch Folder
  - Batch Folder - Add To Index
  - Batch Folder - Assign Document Type
  - Batch Folder - Classify Command
    - Batch Folder - Classify
    - Batch Folder - Train As
    - Batch Folder - Train From
  - Batch Folder - Collapse
  - Batch Folder - Edit Type Assignment
  - Batch Folder - Extract
  - Batch Folder - Group Children
  - Batch Folder - Insert Control Sheets
  - Batch Folder - New Text Document
  - Batch Folder - Remove From Index
  - Batch Folder - Remove Level
  - Batch Folder - Revert To Loose Pages
  - Batch Folder - Set Field Value
  - Batch Folder - Sort Children
- Batch Object
  - Batch Object - Append To Previous
  - Batch Object - Clear Flag
  - Batch Object - Create New Folder
    - Batch Object - Add Folder
    - Batch Object - Insert Folder
  - Batch Object - Flag Item
  - Batch Object - Merge Selected
  - Batch Object - Prepend to Next
  - Batch Object - Rename
  - Batch Object - Run Step
  - Batch Object - Send To Test Batch
    - Batch Object - Copy To Test Batch
    - Batch Object - Move To Test Batch
  - Batch Object - Split Folder
- Batch Page
  - Batch Page - Generate Thumbnail
  - Batch Page - Image Command
    - Batch Page - Display As Binary
    - Batch Page - Display As Color
    - Batch Page - Display As Grayscale
    - Batch Page - Image Editing Command
      - Batch Page - Invert
    - Batch Page - Reset
    - Batch Page - Rotate Left
    - Batch Page - Rotate Right
    - Batch Page - Undo Image Cleanup
  - Batch Page - Image Review Command
    - Batch Page - Apply Image Cleanup
  - Batch Page - Rasterize
- Batch Process
  - Batch Process - Create Project
  - Batch Process - Publish
  - Batch Process - Unpublish
- CMIS Connection
  - CMIS Connection - Import Repository
  - CMIS Connection - Reset
- CMIS Document Link
  - CMIS Document Link - Delete CMIS Document
  - CMIS Document Link - Load
  - CMIS Document Link - Move CMIS Document
  - CMIS Document Link - Save Version
  - CMIS Document Link - Update
- CMIS Export Map - Auto Map
- CMIS Folder Link
  - CMIS Folder Link - Delete
  - CMIS Folder Link - Load Children
  - CMIS Folder Link - Load Pages
  - CMIS Folder Link - Load Properties
  - CMIS Folder Link - Save Properties
- CMIS Import Map - Auto Map
- CMIS Repository - Reset
- CMIS Type Definition - Generate Local Type
- Column Map - Auto Map
- Content Link - Remove Link
- Content Type
  - Content Type - Clean Overrides
  - Content Type - Create Data Model
  - Content Type - Create Local Resources Folder
  - Content Type - Create Search Index
  - Content Type - Delete Search Index
  - Content Type - Generate Control Sheets
  - Content Type - Purge Training
  - Content Type - Rebuild Training
  - Content Type - Submit Indexing Job
- Convert Data - Create Convert Actions
- Copy Base
  - Copy Base - Auto Map
  - Copy Base - Create Child Actions
- Data Connection
  - Data Connection - Connection Command
    - Data Connection - Create Database
    - Data Connection - Create Table
    - Data Connection - Drop Table
  - Data Connection - Test Connection
- Data Element - Remove Overrides
- Data Field Container
  - Data Field Container - Build Fine Tuning File
  - Data Field Container - Container Command
    - Data Field Container - Generate Descriptions
    - Data Field Container - Generate Schema
    - Data Field Container - Import Descriptions
    - Data Field Container - Import Schema
- Data Type - Convert To Value Reader
- EDI File
  - EDI File - Bundle
  - EDI File - Load Data
  - EDI File - Split Envelopes
- Excel Document - Convert to CSV
- Exchange - Rebuild Search Folder
- Field Class - Purge Training
- File Store
  - File Store - Move Objects Here
  - File Store - Test Connection
- File System Link
  - File System Link - Change File Attributes
  - File System Link - Copy File
  - File System Link - Delete File
  - File System Link - Load Content
  - File System Link - Move File
  - File System Link - Save Content
- Folder - Remove Empty Subfolders
- FTP Link
  - FTP Link - Delete File
  - FTP Link - Load Content
  - FTP Link - Save Content
- HTML Document
  - HTML Document - Condition HTML
  - HTML Document - Convert to PDF
  - HTML Document - Convert To Text
- HTTP Link
  - HTTP Link - Load Content
  - HTTP Link - Rename Attachment
- JSON File
  - JSON File - Load Data
  - JSON File - Split
- Lexicon
  - Lexicon - Intersect
  - Lexicon - Merge Training
  - Lexicon - Normalize
  - Lexicon - Subtract
  - Lexicon - Truncate
- Machine - Tune File System
- Mail Link
  - Mail Link - Delete Message
  - Mail Link - Expand Attachments
  - Mail Link - Load Content
- Mail Message
  - Mail Message - Convert To RFC822
  - Mail Message - Expand Attachments
- Node
  - Node - Add Multiple Items
  - Node - Clear Children
  - Node - Clone
  - Node - Delete
  - Node - Move Down
  - Node - Move Up
  - Node - Publish To Repository
  - Node - Rename
- OAuth Client Credentials - Test
- Object Library
  - Object Library - Create Backup
  - Object Library - Rename Script
- PDF Document
  - PDF Document - Burst
  - PDF Document - Compact
  - PDF Document - Repair
- Project - Remove Empty Subfolders
- PST File - Burst
- PST Link - Load Content
- Resource File
  - Resource File - Delete Fine Tuned Model
  - Resource File - Rename
  - Resource File - Start Fine Tuning Job
- Root
  - Root - Database Cleanup
  - Root - Rebuild Indexes
  - Root - Run Import
- Saved Query
  - Saved Query - Delete
  - Saved Query - Rename
- Search Index - Generate Subsets
- SFTP Link
  - SFTP Link - Delete File
  - SFTP Link - Load Content
  - SFTP Link - Save Content
- Text Document
  - Text Document - Insert Page Breaks
  - Text Document - Normalize
  - Text Document - Split
- Value Reader - Convert To Data Type
- vCard - Expand Photo
- Word Document - Convert to PDF
- XML File
  - XML File - Condition XML
  - XML File - Format
  - XML File - Load Data
  - XML File - Split
  - XML File - Validate Schema
- ZIP Archive
  - ZIP Archive - Unpackage
  - ZIP Archive - Unzip
  - ZIP Archive - Update
- ZIP Link - Load Content
Content Link
- Document Link
  - CMIS Document Link
  - File System Link
  - FTP Link
  - HTTP Link
  - Mail Link
  - PST Link
  - SFTP Link
  - Subfile Link
  - ZIP Link
- Folder Link
  - CMIS Folder Link
Data Action
- Action List
- Calculate Value
- Clear Item
- Concat
- Copy Base
  - Append
  - Copy
- Data Lookup
- Execute Rule
- Extract From
- Fill
- Parse Value
- Raise Issue
- Remove
- Require Value
Data Instance
- Checkbox Instance
- Data Element Instance
  - Field Container Instance
    - Element Container Instance
      - Document Instance
      - Section Instance
      - Section Instance Collection
    - Table Instance
    - Table Row Instance
  - Field Instance
    - Table Cell Instance
- Labeled Instance
- Table Header Instance
Export Definition
- CMIS Export
- Data Export
- File Export
  - File Export
  - FTP Export
  - SFTP Export
- Mail Export
Export Format
- Attached File
- Merge Format
  - PDF Format
  - TIF Format
  - XML Format
  - ZIP Format
- Metadata Format
  - JSON Metadata
  - KVP Metadata
    - Delimited Metadata
    - Simple Metadata
  - XML Metadata
- Text Format
Grooper Command Console (GCC)
- connections
- databases
- help
- license
- scripts
- services
- utils
Import Definition
- CMIS Import
Import Provider
- Cmis Import Base
  - Import Descendants
  - Import Query Results
- File Import
  - File System Import
  - FTP Import
  - SFTP Import
- HTTP Import
- Mail Import
- OPEX Import
- Search Import
- Test Batch
IP Command
- Adjust Saturation
- Adjust Tint
- Analyze Photo
- Auto Adjust Levels
- Auto Color Balance
- Auto Convert
- Auto Deskew
- Auto Orient
- Auto QA
- Auto White Balance
- Barcode Detection
- Binarize
  - Threshold
- Blank Page Detection
- Border Detect
  - Auto Border Crop
  - Auto Border Invert
- Box Detection
  - Box Removal
- Brightness Contrast
- Classify Image
- Color Detection
- Color Dropout
- Color Stamp Detection
- Colorize
- Compute Moments
- Contrast Stretch
- Convert
- Corner Detection
- Crop
- Dilate Erode
- Edge Detection
- Execute Profile
- Extract Channel
- Extract Features
- Extract Page
- Feature Dropout
  - Binary Dropout
    - Barcode Removal
    - Blob Removal
    - Border Fill
    - Halftone Removal
    - Hole Punch Removal
    - Speck Removal
  - Scratch Removal
  - Shape Removal
- Filter
- Gamma Adjust
- Histogram
- Hough Lines
- Invert
- Line Detection
  - Line Removal
- Measure Entropy
- Mirror
- Negative Region Removal
- OCR Cleanup
- Patch Code Detection
- Posterize
- Projection Profile
- Randomize Defects
- Resize
- Rotate
- Shade Removal
- Shape Detection
- Solarize
- Sticky Note Detection
- Swap Channels
- Undistort
- Warp
Lookup Specification
- CMIS Lookup
- Database Lookup
- Lexicon Lookup
- Web Service Lookup
- XML Lookup
Measurement
- Logical Measurement
  - Logical Border
  - Logical Point
  - Logical Range
  - Logical Rectangle
  - Logical Size
- Unit Measurement
  - Unit Border
  - Unit Line Length
  - Unit Point
  - Unit Range
  - Unit Rectangle
  - Unit Size
Node
- AI Assistant
- Batch Object
  - Batch Folder
    - Batch
  - Batch Page
- Batch Process
- Batch Process Step
- CMIS Connection
- CMIS Repository
- Content Type
  - Content Category
  - Content Model
  - Document Type
  - Form Type
  - Page Type
- Control Sheet
- Data Connection
- Data Element
  - Data Field
    - Data Column
  - Data Field Container
    - Data Element Container
      - Data Model
      - Data Section
    - Data Table
- Data Rule
- Extractor Node
  - Data Type
  - Field Class
  - Value Reader
- File Store
- Folder
  - Batches Folder
  - Local Resources Folder
  - Machines
  - Projects Folder
- IP Element
  - IP Element Container
    - IP Group
    - IP Profile
  - IP Step
- Lexicon
- Machine
- Object Library
- OCR Profile
- Project
- Resource File
- Root
- Scanner Profile
- Separation Profile
- Training Page
- Work Queue
  - Processing Queue
  - Review Queue
OCR Engine
- Azure OCR
- Layered OCR
- Tesseract OCR
- Transym OCR Engine
  - Transym OCR 4
  - Transym OCR 5
Property Converter
- Auto Deskew - Precision Converter
- Base Multi Culture Converter
  - Multi Culture Converter
  - Multi Language Converter
    - Translate - Source Languages Converter
    - Transym OCR 5 - Tansym Language Converter
- Blank Zero Converter
- Check List Converter
  - AppXtender - Extended Property Converter
  - CMIS Type Reference - Secondary Types Converter
  - CMISQL Query - Joins Converter
  - CMISQL Query - Select Elements Converter
  - Tesseract OCR - Special Fonts Converter
- CMIS Export Map - Column Converter
- CMIS Folder Reference - Converter
- CMIS Import Map - Field Converter
- CMIS Object - Choice Converter
- Code Expression - Converter
- Collection Converter
  - Behavior - All Fields Converter
  - Content Type - Behaviors Converter
  - Export Format - Collection Converter
  - Field Class - Context Zones Converter
  - LDAP - ACL Converter
  - Review - Command Options Converter
- Column Map - Column Converter
- Content Type - Unlimited Converter
- Degrees Converter
  - Square Angle Converter
- Execute Command - Link Name Converter
- Expandable Converter
  - Base Culture Converter
    - Culture Converter
      - Azure Document Intelligence OCR - Language Converter
      - Azure OCR - Language Converter
      - Language Converter
        
        Tesseract OCR - Tess Language Converter
        
        Translate - Target Language Converter
    - Culture Converter All
  - Batch Name Settings - Converter
  - Border - Converter
  - Choice Converter
    - Activity Processing - Queue Converter
    - AI Chat Filter - Index Converter
    - AI Search - Api Version Converter
    - Apply Image Cleanup - Ip Profile Converter
    - Azure OCR - Api Version Converter
    - Azure OCR - Model Version Converter
    - Barcode Extractor - Output Group Converter
    - Base Combining Provider - Group Name Converter
    - Batch - Step Converter
    - Batch Process - Queue Converter
    - Batch Process Step - Processing Scope Converter
    - Batch Process Step - Queue Converter
    - Batch Transfer - Process Converter
    - Batch Transfer - Repository Converter
    - Batch Transfer - Step Converter
    - Build Fine Tuning File - Fill Method Converter
    - Chat Filter - User Id Converter
    - Chat Filter - User Name Converter
    - Classify - Classification Level Converter
    - Classify - Output Level Converter
    - CMIS Export - Creatable Child Type Converter
    - CMIS Export - Creatable Folder Converter
    - CMIS Type Reference - Cmis Type Converter
    - CMISQL Element - Qrderable Property Converter
    - CMISQL Element - Queryable Property Converter
    - CMISQL Element - Selectable Property Converter
    - CMISQL Query - Primary Type Converter
    - ColorTrac Scanner - Resolution Converter
    - Comparison Filter - Function Name Converter
    - Comparison Filter - Operand Type Converter
    - Comparison Filter - Value Type Converter
    - Comparison Predicate - Comp Op Converter
    - Comparison Predicate - Value Converter
    - Data Element - Display Label Converter
    - Data Field - Sub Element Converter
    - Database Table - Extended Property Name Converter
    - Database Table - Table Name Converter
    - EDI Schema Importer - X12Schema Converter
    - Fill - Fill Method Converter
    - Fill Data - Name Converter
    - Fill Descendants - Name Converter
    - Flag Item - Flag Reason Converter
    - Generate Local Type - Doc Type Property Converter
    - Import Provider - Disposition Converter
    - Import Repository - Repository Converter
    - ISIS Device - Device Name Converter
    - Join Clause - Secondary Type Converter
    - Label Info - Parent Label Converter
    - Lexicon Lookup - Lookup Field Converter
    - Lexicon Lookup - Target Field Converter
    - Nested Table - Table Converter
    - ODBC - Pg Odbc Dsn Converter
    - Pattern-Based - Group Name Converter
    - PDF Data Mapping - Font Name Converter
    - Predicate List - Logical Operator Converter
    - Read Metadata - Property Name Converter
    - Reference - Group Name Converter
    - Regular Expression - Group Converter
    - Remove From Index - Index Name Converter
    - Remove Overrides - Property Name Converter
    - Reset - Step Converter
    - Root - License Url Converter
    - Route Definition - Process Converter
    - Run Step - Step Converter
    - Schema Mapping - Schema Name Converter
    - Search Index - Index Name Converter
    - Search Index Query - Index Name Converter
    - Send To Test Batch - Flag Reason Converter
    - Set Field Value - Value Converter
    - String - Pdf Font Name Converter
    - Task Filter - Activity Name Converter
    - Task Filter - Process Name Converter
    - Task Filter - Queue Converter
    - Task Filter - Step Name Converter
    - Text Document - Encoding Converter
    - Text Document - Normalize Encoding Converter
    - TWAIN Device - Compression Mode Converter
    - TWAIN Device - Device Name Converter
    - Update Process - Process Converter
    - Update Process - Step Converter
    - Value Selector - Target Field Converter
    - XML Value Selector - Target Field Converter
  - Double Range - Double Range Converter
  - Expandable Info Converter
  - Integer Range - Integer Range Converter
  - Logical Border - Arrow Converter
  - Logical Border - Logical Border Converter
  - Logical Point - Logical Point Converter
  - Logical Rectangle - Logical Rectangle Converter
    - Logical Rectangle - Simple Rectangle Converter
  - Logical Size - Logical Size Converter
  - On Off Converter
    - Override Converter
    - Verbose On Off Converter
  - Percent Range - Percent Range Converter
  - Point ExF - Converter
  - Rectangle - Converter
  - Type Selector
    - CMISQL Query - Where Element Converter
    - Data Connection - Connection Converter
    - Exchange - Auth Method Converter
    - Execute - Command Converter
    - Execute Activity - Activity Converter
    - Execute Command - Command Converter
    - Run Activity - Activity Converter
    - SharePoint - Auth Method Converter
    - Storage Type - Converter
    - Web Service Lookup - Auth Method Converter
  - Unit Border - Unit Border Converter
  - Unit Line Length - Unit Line Length Converter
  - Unit Point - Unit Point Converter
  - Unit Range - Unit Range Converter
  - Unit Rectangle - Converter
  - Unit Size - Unit Size Converter
  - Value Extractor - Converter
- JPEG 2000 - Ratio Converter
- Logical Value - Simple Value Converter
- Logical Value - Universal Value Converter
- Node Information - Value Converter
- Page Filter Converter
  - Line Filter Converter
- Path Expression - Converter
- Percent Converter
- Pg Dictionary Converter
- Pg Flags Converter
  - Storage Type Numeric - Input Styles Converter
- Pg Ref Collection Converter
- Pg String Collection Converter
  - Batch Filter - Filter Converter
- Pg Type Display Name Converter
  - Add Multiple Items - Item Type Converter
  - Computed Field - Field Type Converter
  - Node Query - Node Type Converter
  - Read Metadata - Source Converter
  - Variable Definition - Variable Type Converter
- Product License - Quantity Converter
- Read Only Converter
- Rectangle - Inches Converter
- Simple Converter
  - Click to Edit Converter
  - Data Action - Source Element Converter
  - Data Action - Target Element Converter
  - IN Predicate - In Predicate Values Converter
  - OAuth Authentication - Login Converter
  - Pattern Match - Group Options Converter
  - Pg Format Converter
  - Product License - Quantity Used Converter
  - Project - Projects Converter
  - Publish To Repository - Repository Converter
  - Result Set Options - Sort Order Converter
  - Review - View List Converter
  - Stats Query - Name List Converter
  - String - Pg Text Lines Converter
  - Type Permissions - Command Converter
  - Word Match - Term Options Converter
- Text Rendering - Size Converter
- Time Range Converter
- Time Ranges Converter
- Time Span Converter HMS
- Timer Service - Time Converter
- Times Converter
- Unit Value - Unit Value Converter
- Word Match - Integer Range Converter
Property Editor
- Anchor Definition - Location Editor
- Barcode Detected - Preview Image Editor
- Choice Property Editor
  - Azure Document Intelligence OCR - Model Editor
  - Base Culture Editor
    - Culture Editor
      - Multi Culture Editor
    - Culture Editor All
    - Language Editor
      - Multi Language Editor
        
        Translate - Source Languages Editor
        
        Transym OCR 5 - Transym Language Editor
      - Tesseract OCR - Tess Language Editor
      - Translate - Target Language Editor
  - Check List Editor
    - AI Assistant - Search Index Editor
    - AI Table Reader - Included Columns Editor
    - Batch Filter - Activity Editor
    - Batch Filter - Process Editor
    - Batch Filter - Status Editor
    - Batch Filter - Step Editor
    - CMIS Type Reference - Secondary Types Editor
    - Data Fill Method - Included Children Editor
    - Delete Fine Tuned Model - Models Editor
    - Generate Local Type - Property Check List
    - IMAP - folder Editor
    - Publish To Repository - Repository Editor
    - Rebuild Indexes - Table Names Editor
    - Reset - Step Checklist Editor
    - Stats Query - Name List Editor
      - Stats Query - Activity Names Editor
      - Stats Query - Machine Names Editor
      - Stats Query - Process Names Editor
      - Stats Query - Stat Names Editor
      - Stats Query - Step Names Editor
      - Stats Query - User Names Editor
    - Table Mapping - Column Check List
    - Text Analysis - Entity Type Editor
    - Type Permissions - Command Editor
  - Data Connection - Table Name Editor
  - Delete Fine Tuned Model - Model Editor
  - GPT Embed - Embeddings Model Editor
  - LLM Connector - Chat Model Editor
  - LLM Connector - Embeddings Model Editor
  - Return Value - Column Editor
  - SQL Server - Database Name Editor
  - Start Fine Tuning Job - Model Editor
- CMIS Compound Type - Editor
- CMISQL Query - Query Editor
  - Import Descendants - Filter Editor
- Code Property Editor
  - AI Chat Filter - Filter Editor
  - AI Schema Extractor - Schema Editor
  - Ask AI - Schema Editor
  - Code Expression Editor
    - Batch Process Step - Next Step Editor
    - Batch Process Step - Should Submit Editor
    - Calculate Value - Value Expression Editor
    - CMIS Export Map - Expression Editor
    - CMIS Import Map - Expression Editor
    - Code Expression - Editor
    - Column Map - Expression Editor
    - Computed Field - Expression Editor
    - Concat - Trigger Editor
    - Content Type - Caption Editor
    - Copy Base - Trigger Editor
    - Custom Statement - Statement Editor
    - Data Export - Alternate Database Editor
    - Data Field - Default Value Editor
    - Data Field - Field Expression Editor
      - Data Field - Calculate Editor
      - Data Field - Required Editor
      - Data Field - Validate Editor
      - Data Field - Validate Message Editor
    - Data Rule - Trigger Editor
    - Data Section - Caption Editor
    - Expression Set - Default Value Editor
    - Expression Set - Field Expression Editor
      - Expression Set - Calculate Editor
      - Expression Set - Required Editor
      - Expression Set - Validate Editor
      - Expression Set - Validate Message Editor
    - IP Element - Next Step Editor
    - IP Element - Should Execute Editor
    - Lookup Specification - Trigger Editor
    - Metadata Options - Value Editor
    - Path Expression - Editor
    - Raise Issue - Log Message Editor
    - Remove - Trigger Editor
    - Require Value - Log Message Editor
    - Text Transform - Record Editor
    - Variable Definition - Expression Editor
  - Create Table - Statement Editor
  - Data Field Container - Css Editor
  - Database Lookup - SQL Query Editor
  - Embedded Lexicon - Local Entries Editor
  - Json Property Editor
  - KVP Editor
  - Lexicon - Lexicon Link Code Editor
  - List Match - Local Entries Editor
  - Mail Import - IMAP Query Editor
  - Pattern Match - Output Format Editor
  - Regex Property Editor
    - Parse Value - Pattern Editor
    - Pattern-Based - Pattern Editor
    - Text Match - Reg Ex Editor
  - Search Classifier - Filter Editor
  - Search Index - Filter Editor
  - Search Index Query - Filter Editor
  - Search Index Query - Order By Editor
  - Search Index Query - Search Editor
  - Send Mail - Template Editor
  - String List Editor
  - Submit Indexing Job - Select Editor
  - Subset Filter - Filter Editor
  - Text Property Editor
    - Node Description Editor
  - Web Service - Header Editor
  - Web Service Lookup - Post Data Editor
  - Web Service Lookup - Url Editor
  - Word Match - Output Format Editor
  - XML Lookup - Selector Editor
  - XML Transform - Transform Editor
  - XML Value Selector - Path Editor
- Folder Browse Editor
  - CMIS Folder Reference - Editor
  - File Directory Editor
  - FTP Export - Ftp Folder Editor
  - Mail Export - Mail Folder Editor
  - SFTP Export - Ssh Folder Editor
- LDAP - ACL Editor
- OAuth Authentication - Login Editor
- Object Collection Editor
  - Content Type - Behavior Collection Editor
  - License Package - License Collection Editor
  - Pattern Match - Group Options Editor
  - Permission Set - Type Perms Editor
  - Predicate List - Predicate Collection Editor
  - Review - Command Options Editor
  - Root - Options Editor
  - Word Match - Term Options Editor
- Object Properties Editor
  - Data Type - Collation Editor
  - IP Step - Command Editor
- Open File Editor
  - Import Device - Zip File Editor
- Reference Editor Base
  - Node Reference Editor
    - Archive - Folder Editor
    - Batch Process Step Editor
    - Content Type Editor
      - Child Type Editor
      - Content Model - Child Content Type Editor
      - Content Scope Editor
      - Content Type - Parent Type Editor
    - Custom Statement - Scope Editor
    - Data Action - Action Element Editor
    - Data Action - Source Editor
      - Data Action - Source Container Editor
      - Data Action - Source Element Editor
      - Data Action - Source Field Editor
    - Data Action - Target Editor
      - Data Action - Target Collection Editor
      - Data Action - Target Container Editor
      - Data Action - Target Element Editor
        
        Concat - Target Collection Editor
        
        Remove - Target Collection Editor
      - Data Action - Target Field Editor
    - Data Field Container - Rule Editor
    - Data Rule - Scope Editor
    - Database Cleanup - Folder Editor
    - Dispose Batch - Target Folder Editor
    - Execute Rule - Rule Editor
    - Field Match - Field Editor
    - Generate Subsets - Field Editor
    - Grid Layout - Header Column Editor
    - Piece Info Options - Key Column Editor
    - Piece Info Options - Value Column Editor
    - Return Value - Field Editor
    - Set Field Value - Field Editor
    - Table Mapping - Scope Editor
    - Task Filter - Batch Editor
    - Test Batch Editor
    - Text Transform - Scope Editor
    - Train Lexicon - Scope Editor
    - Virtual Table Definition - Collection Editor
    - Web Service - Definition File Editor
  - Ordered Reference Editor
    - Generate Control Sheets - Document Types Editor
    - Virtual Table Definition - Columns Editor
  - Reference List Editor
    - All Nodes Reference Editor
    - Behavior - Field List Editor
      - Field Annotation - Field Annotation Editor
    - Bookmark Options - Data Element Editor
    - Build Fine Tuning File - Batch Editor
    - Content Types Editor
      - Child Types Editor
    - Correct - Fields Editor
    - Data Fill Method - Included Descendants Editor
    - Data Model - Style Sheets Editor
    - Data Rule - Required Elements Editor
    - Data Values - Included Elements Editor
    - Extract - Data Element Filter Editor
    - Indexing Behavior - Included Elements Editor
    - JSON Data Mapping - Included Elements Editor
    - Lexicon - Lexicons Editor
    - Piece Info Options - Element Editor
    - Project - Projects Editor
    - Redact - Extractors Editor
    - Redact - Fields Editor
    - Require Value - Required Elements Editor
    - Section Extract Method - Included Descendants Editor
    - Thumbnail View - IP Profiles Editor
    - Transaction Detection - Field List Editor
- Sample Image Collection - Editor
- Value Extractor - Editor
- Zone Editor
Schema Importer
- AI Generated
- CMIS Schema Importer
- Database Schema Importer
- EDI Schema Importer
- XML Schema Importer
Section Extract Method
- AI Section Reader
  - AI Collection Reader
- AI Transaction Detection
- Clause Detection
- Divider
- Fixed
- Full Page
- Geometric
- Nested Table
- Simple
- Transaction Detection
Separation Provider
- AI Separate
- ESP Separator
  - ESP Auto Separation
- Extractor Based Provider
  - Change In Value Separator
  - EPI Separation
  - Pattern-Based Separation
- Multi Separator
- Real Time Provider
  - Control Sheet Separation
    - Event-Based
- Undo Separation
Service Instance
- Activity Processing
- API Services
- Import Watcher
- Indexing Service
- System Maintenance Service
- Timer Service
- Web Service
  - Grooper Licensing
Storage Type
- Boolean
- Custom
- GUID
- Storage Type Ranged
  - DateTime
  - Storage Type Numeric
    - Decimal
    - Double
    - Int16
    - Int32
    - Int64
- String
- URL
Table Extract Method
- AI Table Reader
- Delimited Extract
- Fixed Width
- Fluid Layout
- Grid Layout
- Row Match
- Tabular Layout
Task View
- Data View
- Fiche Strip View
- Folder View
  - Classification View
  - Scan View
- Separation View
- Thumbnail View
UI Element
- Control
  - Active Task List
  - AI Helper
  - Batch Info Tab
    - Batch Details Viewer
    - Batch Events Viewer
    - Batch History Viewer
    - Batch Stats Viewer
    - Task Chart
  - Batch Info Viewer
  - Batch List
  - Batch Manager
  - Candidate List
  - Card List
  - Chat Console
  - Class Help
  - CMIS Repository Searcher
  - CMIS Tree Browser
  - CMIS Type Tree
  - Code Editor
  - Complete List
  - Content Viewer
    - HTML Viewer
    - Mail Viewer
    - NDJSON Editor
    - Null Viewer
    - Page Viewer
    - Text Editor
    - ZIP Viewer
  - Context Menu
  - Conversation Viewer
  - Data Element Tester
  - Data Grid
  - Data Grid Document
  - Data Grid Element
    - Data Grid Collection
    - Data Grid Container
    - Data Grid Field
    - Data Grid Table
    - Virtual Table
  - Data Inspector
  - Data Tree
  - Design Tab
    - AI Assistant - Chat History
    - Batch
      - Batch - General
      - Batch - Viewer
    - Batch Folder - General
    - Batch Page - General
    - Batch Process
      - Batch Process - Batches
      - Batch Process - General
    - Batch Process Step
      - Batch Process Step - General
      - Batch Process Step - Testing Tab
        
        Batch Process Step - Activity Tester
        
        Batch Process Step - Classification Tester
        
        Batch Process Step - ESP Separation Tester
        
        Batch Process Step - Recognition Tester
        
        Batch Process Step - Redaction Tester
        
        Batch Process Step - XSLT Editor
    - CMIS Connection - General
    - CMIS Repository
      - CMIS Repository - Browse
      - CMIS Repository - Search
      - CMIS Repository - Types
    - Content Type
      - Content Type - Documents
      - Content Type - Labels
      - Content Type - Overrides
      - Content Type - Training Samples
      - Content Type - Weightings
    - Control Sheet - General
    - Data Connection - General
    - Data Element
      - Data Element - General
      - Data Element - Tester
    - Data Rule - Tester
    - Extractor Node - Tester
    - Field Class - Weightings
    - Folder - Batches
    - IP Element Container - Tester
    - IP Step - Tester
    - Lexicon - General
    - Machines
      - Machines - General
      - Machines - Services
    - Node
      - Node - Advanced
      - Node - General
      - Node - Reports
      - Node - Scripting
    - OCR Profile - Tester
    - Processing Queue - Workers
    - Project - Usage
    - Resource File - General
    - Root
      - Root - Events
      - Root - Licensing
      - Root - Scripts
    - Training Page - General
  - Design Tab Host
  - Diagnostics Viewer
  - Document Searcher
  - Document Viewer
  - Expression Grid
  - Extractor Builder
  - FRX Grid
  - FRX Visualizer
  - Image Editor
  - Image Print Preview
  - Image Viewer
  - Instance Searcher
  - Label Set Editor
  - List Searcher
  - Lookup Fields
  - Lookup Results
  - Node Finder
  - Node Report
    - Content Type Report
      - Circular Expressions
      - Data Elements
      - Derived Types
      - Expressions
      - Property Overrides
      - Validation Rules
    - Descendants
  - Object List
    - Candidate Type List
    - CMIS Results List
    - Data Row List
    - Document List
    - Instance Result Set
    - Node List
    - Reflection List
      - CMIS Object List
      - Instance List
      - Principal List
    - Search Result List
    - String List
    - Table Info List
  - OCR Viewer
  - Page Navigator
  - Profile Browser
  - Property Grid
  - Property Grid Editor
    - ACL Editor
    - Anchor Editor
    - Choice Editor
    - CMIS Query Editor
    - Code Property Editor
    - Collation Editor
    - Collection Editor
    - Extractor Property Editor
    - Folder Editor
    - List Editor
    - Multi Reference Editor
    - OAuth Log-in Editor
    - Object Editor
    - Ordered Reference Editor
    - Preview Image Editor
    - Reference Editor
    - Sample Image Editor
    - Zone Editor
  - Property Help
  - Query Editor
  - Query Helper
  - Query List
  - Recognition Tester
  - Rep Info Panel
  - Review Tab
    - Batch Viewer
    - Classify Viewer
    - Data Viewer
    - Scan Viewer
    - Separation Viewer
    - Thumbnail Viewer
  - Search Result Cards
  - Separation List
  - Service Collection
  - Splitter
  - Stats Report
  - Stats Result Set
  - Stats Viewer
  - Tab List
  - Task List
  - Test Source
  - Tree Viewer
    - Editor Tree
    - Override Tree
  - Upload Dialog
  - Weightings List
- Web Page
  - Batches Page
  - Chat Page
  - Design Page
  - Help Page
  - Home Page
  - Imports Page
  - Jobs Page
  - Review Page
  - Search Page
  - Stats Page
  - Tasks Page
Value Extractor
- AI Column Extractor
- AI Schema Extractor
- Ask AI
- Barcode Extractor
  - Find Barcode
  - Read Barcode
- Detect Signature
- Highlight Zone
- Labeled Value
- OMR Extractor
  - Labeled OMR
  - Ordered OMR
  - Zonal OMR
- Query HTML
- Query XML
- Read Metadata
- Read Zone
- Reference
- Select Page
- Text Analysis
  - Entity Recognition
  - Key Phrase Extraction
  - Pii Entity Recognition
- Text Match
  - Field Match
  - List Match
    - Label Match
  - Pattern Match
  - Word Match
Variable Provider
- Alpha Provider
- Culture Info Provider
  - Currency Decimal Digits
  - Currency Decimal Separators
  - Currency Group Digits
  - Currency Group Separators
  - Currency Labels
  - Currency Symbols
  - Day Names
  - Day Names Abbreviated
  - Day Names Shortest
  - Digits
  - Letters
  - Letters Lower
  - Letters Upper
  - Month Names
  - Month Names Abbreviated
  - Month Names Genetive
- Expression Lexicon Provider
- Extractor Variable Provider
- Field Value List Provider
- Field Variable
- Group Vocabulary Provider
- Number Names Provider
- Number Provider
- Referenced Lexicon Provider
- Vocabulary
Other Configuration Types
- API Key
- Archive Info
- Border
- Capture Settings
- Character Class Filter
- Chat Parameters
- CMIS Object
  - CMIS Document
  - CMIS Folder
- CMIS Property Definition
  - CMIS Boolean Property Definition
  - CMIS DateTime Property Definition
  - CMIS Decimal Property Definition
  - CMIS HTML Property Definition
  - CMIS ID Property Definition
  - CMIS Integer Property Definition
  - CMIS String Property Definition
  - CMIS URI Property Definition
- Code39Settings
- Connected Object
  - Batch Filter
  - Chat Filter
  - Database Row
    - AI Chat
    - AI Message
    - Doc Index
    - File Store Entry
    - Import Job
    - Index State
    - Index Table
      - Batch State
    - Log Event
    - Processing Job
    - Processing Task
    - Saved Query
    - Session Stats
  - Embedded Object
    - AI Chat Filter
    - AI Chat Settings
    - AI Generator
    - Anchor Definition
    - Attachment Rule
    - Auto Complete Settings
    - Barcode Reader
      - 1D Reader
      - 2D Reader
      - Postcode Reader
      - Standard Reader
    - Batch Creation Settings
    - Batch Name Settings
    - Bookmark Options
    - Bot Connector
    - Boundary Detector
    - Chunk Settings
    - Cluster Parameters
    - CMIS Export Map
    - CMIS Folder Reference
    - CMIS Import Map
    - CMIS Type Definition
    - CMIS Type Reference
      - CMIS Compound Type
    - Code Expression
      - Boolean Expression
      - String Expression
    - Column Map
    - Command Options
    - Computed Field
    - Content Mapping
    - Custom Statement
    - Data Element Extension
      - AI Extract Field Options
      - AI Extract Section Options
      - AI Extract Table Options
      - Grid Layout Options
      - Tabular Layout Options
    - Data Element Profile
    - Data Fill Method
      - AI Extract
      - Fill Descendants
      - Run Child Extractors
    - Data Generator
    - Edge Adjustment
      - Absolute
      - Anchor
      - Edge of Page
      - Relative
    - Embedded Lexicon
      - Field Value Lexicon
      - Fuzzy Match Weightings
      - List Match Entries
    - Environment Options
    - Execute Step
      - Execute Activity
      - Execute Command
    - Expression Set
    - Field Annotation
      - Field Widget Annotation
        
        Checkbox Widget
        
        Radio Group Widget
        
        Signature Widget
        
        Textbox Widget
      - Highlight Annotation
      - Text Annotation
    - Field Mapping
    - File Reference
      - Resource File Reference
      - UNC File Reference
      - URL File Reference
    - Folder Level Info
    - FRX Options
    - FTP Repository Configuration
    - Fuzzy Lookup Options
    - Horizontal Tab Marker
    - HTTP Auth Method
      - Basic
      - OAuth Client Credentials
    - HTTP Resource
    - Hyperlink Selector
    - Image Segmentation Options
    - Import Schedule
      - Polling Loop
      - Specific Times
    - Index Stats
    - Label Info
    - Label Set
    - Label Version
    - Layout Provider
      - Flow
      - Horizontal
      - Vertical
    - Line Periodicity Detector
    - LLM Provider
      - Azure Provider
      - GCS Provider
      - Open AI Provider
    - Lucene Query
      - Lucene Group
      - Lucene Phrase
      - Lucene Word
    - Metadata Options
    - Multiline Row Settings
    - OCR Layer
    - OCR Repair Options
      - Spell Corrector
    - OMR Box
    - Page Attachment Rule
    - Paragraph Marker
    - Path Expression
    - PDF Expand Method
      - Bookmarks
      - Fixed Page Count
      - Page Piece
      - Tag Based
    - Permission Set
    - Piece Info Options
    - Quoting Method
      - Data Values
      - Extracted
      - Labeled Region
      - Layout Objects
      - Multi Quote
      - Semantic
    - Region Definition
      - Dynamic Region
        
        Shape Region
        
        Text Region
      - Fixed Region
        
        Relative Region
    - Repository Configuration
    - Repository Option
      - AI Search
      - LLM Connector
      - Text Analysis Option
    - Resource Reference
      - Bing Search
      - Database Table
      - Search Index
      - Web Service
    - Result Filter
    - Result Processor
      - OCR Reader
      - OMR Reader
      - Place Zone
    - Result Set Options
    - Return Value
    - Route Definition
    - Sample Image Collection
    - Schema Mapping
    - Search Filter
      - Boolean Filter
      - Field Filter
        
        Comparison Filter
        
        In Filter
      - Is Match Filter
      - Lambda Filter
    - Separate Action
      - Separation Event
        
        Barcode Detected
        
        Blank Page Detected
        
        Content Type Detected
        
        Page Count
        
        Shape Detected
    - Service Deployment
      - Chat Service
      - Embeddings Service
      - Fine Tuning Service
    - Service Stats
    - Stats Query
    - Subset Filter
    - Table Header Detector
    - Table Mapping
    - Table Row Detector
    - Text Preprocessor
    - Transaction Extractor
    - Type Permissions
    - Value Lookup
      - Group Options
    - Value Selector
    - Variable Definition
    - Vector Search Options
    - Vertical Tab Marker
    - Virtual Table Definition
    - XML Value Selector
  - Node Query
  - Purge Folder
  - Search Index Query
  - Task Filter
    - Attended Task Filter
    - Unattended Task Filter
- Constrained Wrap Options
- Culture Data
- Dash Detector
- Database Connection Settings
  - ODBC
  - SQL Server
    - Repository Connection
- Defect Generator
  - Border Generator
  - Image Scaler
  - Image Skewer
  - Image Translator
  - Noise Generator
- Double Range
- Dropout Method
  - Fill
  - Inpaint
- Event Filter
- Fiche Card Layout
- Folder Level Options
- Horizontal Alignment Settings
- HTTP Authentication Method
  - Anonymous Authentication
  - Auto Authentication
  - Basic Authentication
  - NTLM Authentication
  - OAuth Authentication
    - Azure OAuth
      - Exchange OAuth
      - OneDrive OAuth
      - SharePoint OAuth
  - OAuth Service Login
- Image Compression
  - JPEG
  - JPEG 2000
- Image Info
- Integer Range
- Line Snap Options
  - Result Snap Options
- Margin Detector
- Multi Line Settings
- Node Information
- PDF Burst Settings
- PDF Page Generator
- PDF Render Settings
- Percent Range
- Rectangle
- Region Detector
- Regular Expression
  - Attribute Rule
  - Wrap Rule
- Remote Repository
- Row Alignment Settings
- Scan Once Settings
- Semantic Quoting Query
- SFTP Repository Configuration
- Shell Execute Info
- Sort Specification
- System Config
- Text Wrap Options
- TIFF Page
- Transaction Layout Detection
- Vertical Wrap Detection
Advanced Topics
- CSS Builder
- Data Model Compiler
- Data Model Expression Builder
- Expression Builder
- Fuzzy Regular Expression
- Layout Data
- Real Time Image Processor
- Retrieval Plan
- Single - Fuzzy Match Cost Map
- Task Processor
Enumerations
- Grooper
  - CharacterCasing
  - ConcurrencyMode
  - DatabaseStatus
  - DCTModes
  - EventType
  - NodeAttributes
  - Pages
  - PixelFormat
  - ProcessingScope
  - ProcessingStatus
  - ResultOrder
  - SimplePixelFormat
- Grooper.Activities
  - ActionType
  - BatchDisposition
  - BatchNameSuffixEnum
  - BodyRenderingMethod
  - ComparisonMode
  - DuplicateDisposition
  - ExecuteType
  - ExecutionScope
  - ExtractMode
  - FilterType
  - MatchActions
  - OcrAssistMode
  - PageExtractMode
  - ProblemDisposition
  - ReclassifyModes
  - RepairScope
  - RouteMethod
  - SaveDisposition
  - SharedBehaviorModes
  - SpawnMethod
  - StatsLoggingMode
  - TextExtractMode
  - TrainingScope
  - XmlSource
  - XmlTarget
- Grooper.Capture
  - FeedOrientation
  - ImportType
  - MissDispositionEnum
  - PageDirection
  - ScanningSpeed
  - TwainCompressionModes
- Grooper.Capture.ColorTrac
  - ColorFormat
  - PageSizeMode
  - PaperEndCondition
  - PaperJustification
  - ScanSpeed
  - StandardPageSize
- Grooper.Cloud
  - ApiRegionEnum
  - ContentLayout
  - HttpVerbs
  - MessageFormats
  - MetadataModes
  - TranslateDisposition
- Grooper.CMIS
  - AuthenticationProvider
  - CmisProtocol
  - ImportModes
  - LoadScope
  - NamingMethods
  - OrderByDirection
  - TransferScopes
- Grooper.Core
  - ActivateModes
  - ArrayActions
  - AttachmentPosition
  - BrowserSuggestMode
  - CalculateModes
  - CalculateModes
  - CaptureScope
  - ClassificationLevel
  - CompareMode
  - ConflictDispositions
  - ConflictResolution
  - ControlCharacters
  - CreateModes
  - DedupMode
  - DispositionType
  - DuplicateFilenameResolution
  - FolderRelativePosition
  - FolderRelativePosition
  - FooterModes
  - FormatOptions
  - FuzzyMatchMode
  - GroupingColumn
  - IdfModes
  - IssueDisposition
  - JsonLayout
  - LexiconType
  - MergeModes
  - MissDispositions
  - MissDispositions
  - NumberFormats
  - OxiElement
  - PaginationType
  - ParagraphOptions
  - PdfBuildOptions
  - PopulationMethod
  - ProcessingLevel
  - PropagationMode
  - SegmentType
  - SortColumns
  - SortDirection
  - SortDirections
  - SortDirections
  - SortOption
  - SortOrder
  - StandardWeightings
  - TabOptions
  - TaskScope
  - TfModes
  - TimeFrames
  - TimeGrouping
  - TrainingScopes
  - TriggerModes
  - TypeKind
  - TypeModes
  - TypeOperation
  - UserTrainingMode
  - ValueInterpretations
  - ZIPDispositions
- Grooper.EDI
  - AttachmentNamingMethods
  - DataDisposition
  - NamingMethods
  - NamingMethods
- Grooper.Extract
  - AdjustmentMethod
  - AlignmentMode
  - CollationType
  - CombineType
  - CompassDirection
  - ConfidenceModes
  - ContextScopes
  - CultureScopes
  - ExecutionScope
  - FlowDirection
  - GroupingType
  - HorizontalDataAlignment
  - HorizontalDataAlignment
  - LabelLayout
  - LookupOption
  - MappingType
  - OmrBoxDirection
  - OmrFlowDirection
  - OmrMode
  - OutputValueOptions
  - ReadDirection
  - ReadMethods
  - ReferencePointPosition
  - ROIModes
  - RowDetectionMode
  - RowMatchOptions
  - SecondaryExtractMethod
  - SecondaryExtractTrigger
  - SplitPositionEnum
  - TableRowAlignment
  - TableStyles
  - VerticalDataAlignment
  - WordTransform
- Grooper.GPT
  - AuthorizationMethod
  - BooleanOperator
  - BuiltInFieldKinds
  - DocumentLinkingOptions
  - FieldAlignMode
  - IndexOperations
  - LambdaFunction
  - LayoutComponentTypes
  - OperationType
  - QueryTypes
  - ReasoningEffortLevels
  - ResultOrder
  - RetrievalOptions
  - RowAlignMode
  - SearchModes
  - SectionAlignMode
  - ServiceTiers
  - VerbosityLevels
- Grooper.IP
  - AdaptiveKernelType
  - AngleCategory
  - Axis
  - BinarizationMethod
  - ChannelNumber
  - Code39Options
  - ColorSpaceType
  - CombDetectionType
  - CompressionMode
  - Connectivity
  - CropMethod
  - CurveType
  - DetectMethod
  - FeatureType
  - FillMethod
  - FilteringLevel
  - FilterTypeEnum
  - HarrisFilterType
  - HoughLevel
  - ImageEdges
  - InpaintMethod
  - MaskShape
  - MaskSize
  - MeasurementType
  - Method
  - OneDimSymbology
  - OperationType
  - OperationType
  - Pdf417Options
  - PostSymbology
  - ProcessingResolution
  - ProgressionOrder
  - ReadDirection
  - ReadingQuality
  - ResizeInterpolationMode
  - SizeMethod
  - Symbology
  - TwoDimSymbology
  - WarpInterpolationMode
- Grooper.Messaging
  - BodyHandling
  - Orientation
  - PaperKind
  - SaveAction
  - SelectorKind
- Grooper.OCR
  - AccuracyLevels
  - BaseCharacterSetEnum
  - DetectionMethod
  - EngineModeEnum
  - FontPitchMode
  - LexMode
  - PageOrientation
  - PageOrientation
  - SegmentationModeEnum
  - SynthesisMethodEnum
- Grooper.Office
  - SaveMethod
- Grooper.PDF
  - CompressionMode
  - ImageLayout
  - PDFAComplianceLevels
  - PdfBorderStyle
  - PdfDisplayMode
  - PdfPermissions
  - PdfViewerOptions
  - SearchableTextFormat
  - TargetColorFormat
- Grooper.Services
  - DaysOfWeek
- Grooper.Services.CMIS
  - ConnectMethod
  - ContentMode
  - FileType
  - FormOverlayType
  - MergeAction
- Miscellaneous
  - BaseTypeId
  - CharacterCasing
  - CompressionLevel
  - ContentAlignment
  - DateTimeStyles
  - FileAttributes
  - FontStyle
  - Formatting
  - HorizontalAlignment
  - Keys
  - NumberStyles
  - RegexOptions
  - ThreadPriority
  - UriKind

Text Preprocessor

Inherits From Embedded Object Namespace Grooper.Core

Applies configurable text preprocessing to a document's content before regular expression extraction.

Remarks

The Text Preprocessor enables advanced manipulation of control characters in a document's text, allowing regular expressions to match or ignore structural elements such as line breaks, paragraph boundaries, page breaks, tabs, and spaces.

Overview

Text preprocessing is performed immediately before extraction, transforming the document's text to improve the accuracy and flexibility of pattern matching. This is especially useful when data values span multiple lines, are separated by large whitespace gaps, or are affected by inconsistent formatting.

Key Features

Paragraph Marking:
Detects paragraph boundaries and converts line breaks within paragraphs to spaces, while preserving paragraph-ending breaks. This allows extractors to match values that span multiple lines within a paragraph, without matching across paragraph boundaries. See Paragraph Marker.
Tab Marking:
Replaces large horizontal whitespace gaps with TAB characters, making it possible to distinguish between normal spaces and significant gaps in regular expressions. See Horizontal Tab Marker.
Vertical Tab Marking:
Converts certain line breaks to vertical tab characters based on vertical spacing, enabling recognition of vertical structure in tabular or multi-column layouts. See Vertical Tab Marker.
Control Character Ignoring:
Removes or replaces selected control characters (such as spaces, newlines, form feeds, and carriage returns) according to the 'Ignore Control Characters' setting. This can simplify extraction in documents with inconsistent or excessive whitespace.

Usage Guidance

Configure the desired preprocessing options by enabling or disabling paragraph, tab, and vertical tab marking, and by selecting which control characters to ignore.
Preprocessing is typically used in conjunction with regular expression-based extractors, but can benefit any extraction scenario where document structure affects pattern matching.
For best results, adjust preprocessing settings to match the structure and formatting of your source documents.

Example Scenarios

Extracting values that span multiple lines within a paragraph:
Enable paragraph marking to convert internal line breaks to spaces, allowing regular expressions to match values split across lines.
Distinguishing between normal spaces and large gaps:
Enable tab marking to insert TAB characters at significant horizontal gaps, so extractors can target fields separated by large whitespace.
Cleaning up unwanted whitespace or control characters:
Use the 'Ignore Control Characters' option to remove or replace problematic characters that interfere with extraction.

For more details, see the documentation for Paragraph Marker, Horizontal Tab Marker, and Vertical Tab Marker.

Examples

1. Sample Document

Consider the following sample document.

┌─────────────────────────────────────────────────────────────┐
│                        SAMPLE FORM                          │
├─────────────────────────────────────────────────────────────┤
│ Name:           John Doe                   ID: 12345        │
│ Date of Birth:  01/01/1980                 Status: Active   │
├─────────────────────────────────────────────────────────────┤
│ This is the first paragraph. It explains the purpose of     │
│ the form and the meaning of each field.                     │
│                                                             │
│ Please complete all fields and verify all personal          │
│ information before submitting. Thank you!                   │ 
└─────────────────────────────────────────────────────────────┘

2. Default Control Characters

With no preprocessing options enabled, the document data will look like this. Whitespace gaps, no matter how large, are represented by a single space character. A \r\n pair marks each location where the original document wrapped to the next line.

SAMPLE FORM\r\n
Name: John Doe ID: 12345\r\n
Date of Birth: 01/01/1980 Status: Active\r\n
This is the first paragraph. It explains the purpose of\r\n
the form and the meaning of each field.\r\n
Please complete all fields and verify all personal\r\n
information before submitting. Thank you!\r\n

3. Preprocessed Version

Preprocessing the document with paragraph marking and tab marking will place a tab character '\t' at each large whitespace gap, and replace newline pairs '\r\n' occuring inside a paragraph with a space.

SAMPLE FORM\r\n
Name: John Doe\tID: 12345\r\n
Date of Birth: 01/01/1980\tStatus: Active\r\n
This is the first paragraph. the form and the meaning of each field.\r\n
Please complete all fields and verify all personal information before submitting. Thank you!\r\n

Properties

Name Type Description

Paragraph Marking

Paragraph Marker

►

Detects and marks paragraph boundaries in natural language documents to improve data extraction from paragraph flow text.

Can be one of the following types:

Value	Description
Enabled
Disabled

The Paragraph Marker is a text preprocessing component used to identify paragraph boundaries in documents, especially those containing natural language text. By marking paragraphs, it enables more accurate extraction of data that may span multiple lines within a paragraph, while preserving true paragraph breaks.

Purpose

Paragraphs in documents often wrap across multiple lines, causing data values to be split by line breaks (CR/LF). This can make it difficult for extractors to match values that span lines, as standard extraction logic may not account for embedded line breaks within paragraphs.

The Paragraph Marker solves this by detecting paragraph boundaries and converting line breaks inside paragraphs to spaces, while leaving the line break at the end of each paragraph intact. This produces a normalized text flow, making it easier to extract values that span lines.

How It Works

The Paragraph Marker processes the text of a document by analyzing each line and determining whether it should be joined with the previous line or treated as the start of a new paragraph.

The main algorithm works as follows:

The text is split into lines.
For each line, a set of rules is evaluated to determine if it is the start of a new paragraph. These rules include:
- Line width (absolute and relative to the widest line)
- Presence of large horizontal or vertical gaps
- Indentation changes
- Custom bullet or pattern matches (using 'Paragraph Break Rule')
- Detection options such as bullets, double spacing, and underlines
If a line is determined to be a paragraph start, the previous paragraph is finalized, and a new paragraph begins.
Line breaks within paragraphs are replaced with spaces, while true paragraph breaks are preserved as CR/LF pairs.

This approach ensures that wrapped lines within a paragraph are merged for extraction, while true paragraph boundaries are maintained for downstream processing.

Example

Consider the following paragraph, where the effective date is split across two lines:

This agreement, to be effective February
1, 1988, is executed on January 15, 1988.

Without paragraph marking, an extractor searching for "February 1, 1988" would not find a match due to the embedded line break (\r\n) after 'February'. With paragraph marking enabled, the text is normalized as:

This agreement, to be effective February 1, 1988, is executed on January 15, 1988.

Now, extractors can reliably match values that span lines within a paragraph, without overmatching across true paragraph boundaries.

Configuration Guidance

Use the 'Minimum Line Width' and 'Line Wrap Threshold' properties to control how paragraph boundaries are detected based on line length.
Adjust 'Maximum Horizontal Gap' and 'Line Spacing Limit' to fine-tune detection for documents with variable spacing or formatting.
Enable detection options such as bullets, double spacing, or underlines to handle specialized paragraph structures.
Use the 'Paragraph Break Rule' property to define custom logic for identifying the start of new paragraphs, such as custom bullet formats.

Usage Notes

Paragraph Marker is typically used as part of a text preprocessing pipeline before data extraction.
Proper configuration is essential for accurate paragraph detection, especially in documents with complex or inconsistent formatting.
For more information on related concepts, see Data Instance, Document Instance, and Value Extractor.

Tab Marking

Horizontal Tab Marker

►

Detects and inserts tab characters into text based on whitespace gaps, font size changes, or document layout features such as vertical lines and underlines.

Can be one of the following types:

Value	Description
Enabled
Disabled

The Horizontal Tab Marker class is used to identify locations in text where a tab character (\t) should be inserted, typically to represent columnar or tabular structure in extracted document content.

Overview

Horizontal Tab Marker analyzes the spacing between words and other layout cues to determine where tabs should be placed. It is commonly used in text preprocessing to convert visually separated columns or fields into a tab-delimited format, making downstream data extraction and parsing more reliable.

How It Works

The Horizontal Tab Marker processes the text by analyzing the gaps between words and determining where a tab character should be inserted.

The main algorithm works as follows:

The text is split into word instances.
For each pair of adjacent words, the gap between them is measured.
A set of rules is evaluated to determine if the gap qualifies for tab insertion. These rules include:
- Whitespace Gaps: If the space between two words meets or exceeds the configured 'Minimum Tab Width', it is replaced with a tab character.
- Relative to Text Height: Optionally, gaps can be evaluated as a percentage of the average character height using the 'Character Size Ratio' property.
- Font Size Changes: If the font size changes between adjacent words by more than the 'Font Size Threshold', a tab may be inserted.
- Vertical Lines: When enabled via 'Detection Options', vertical lines in the document layout can trigger tab insertion at their intersection with text.
- Underlines: When underline detection is enabled, tabs are suppressed for whitespace gaps that are underlined, supporting fill-in-the-blank scenarios.
- If the gap meets any of the criteria, the whitespace is replaced with a tab character.

This approach ensures that visually separated columns or fields are accurately marked with tabs, improving the reliability of downstream data extraction and parsing.

Configuration Guidance

Set 'Minimum Tab Width' to control the minimum gap size (in inches) that qualifies for tab insertion.
Use 'Character Size Ratio' to enable gap detection relative to text height, which is useful for documents with variable font sizes.
Adjust 'Font Size Threshold' to trigger tabs on significant font size changes, helping to separate fields with different formatting.
Use 'Detection Options' to enable or disable vertical line and underline detection as needed for your document layout.

Example 1: Field Extraction with Large Whitespace Gap

For example, consider a document region containing two field values with a large whitespace gap in between, like this:

PATIENT NAME: JOHN DOE INTAKE DATE: 01/01/2019

When text is extracted without tab marking, the large gap is represented as a single space, making it difficult to determine where one field ends and the next begins. If you use an extractor with a pattern like PATIENT NAME: [A-Z ]+, it will overmatch and return "JOHN DOE INTAKE DATE" instead of just "JOHN DOE", because the input data looks like this:

PATIENT NAME: JOHN DOE INTAKE DATE: 01/01/2019

By enabling tab marking, the large gap is replaced with a tab character. Now an extractor looking for PATIENT NAME: [A-Z ]+ will match only JOHN DOE, because the regular expression will stop capturing when it encounters the TAB character:

PATIENT NAME: JOHN DOE\tINTAKE DATE: 01/01/2019

Example 2: Table Row with Multiple Columns

Consider a table row in a document where columns are separated by large whitespace gaps:

Name State Age

Without tab marking, the extracted text may look like:

Name State Age

This makes it difficult to reliably extract each column value. With tab marking enabled, the output will be:

Name\tState\tAge

Now, each value is clearly separated by a tab character, making column-based extraction straightforward and robust.

Notes

Horizontal Tab Marker is typically used as part of a text preprocessing pipeline before data extraction.
Proper configuration of tab detection options is essential for accurate column and field separation, especially in documents with complex layouts.
For more information on related concepts, see Data Instance, Document Instance, and TabOptions.

Vertical Tab Marking

Vertical Tab Marker

►

Detects and marks large vertical whitespace gaps between lines with a vertical tab character to represent vertical separation in text.

Can be one of the following types:

Value	Description
Enabled
Disabled

The Vertical Tab Marker is a text preprocessing component that identifies significant vertical gaps between lines in a document and replaces the standard line break (CR/LF) with a vertical tab character (\v). This is useful for representing vertical structure—such as section breaks, table row separation, or logical grouping—within extracted text.

Purpose

In many documents, a large vertical gap between lines indicates a new section, a table row, or a logical break. Standard line breaks do not distinguish between normal line wrapping and these larger separations, making it difficult for downstream extractors to interpret the document's structure.

The Vertical Tab Marker solves this by converting line breaks to vertical tab characters when the vertical gap between two lines exceeds the configured threshold. This allows extractors and parsers to recognize and handle vertical structure more accurately.

How It Works

The text is split into lines.
For each pair of adjacent lines, the vertical distance between the bottom of the previous line and the top of the current line is measured.
If this gap is greater than or equal to the 'Vertical Gap Threshold', the line break is replaced with a vertical tab character (\v).
Otherwise, the standard line break is preserved.

This approach enables downstream extraction logic to distinguish between normal line wrapping and significant vertical separations, improving the accuracy of data extraction from structured documents.

Configuration Guidance

Set the 'Vertical Gap Threshold' property to control the minimum vertical distance (in inches, centimeters, or points) that qualifies for vertical tab insertion.
Adjust this value to match the typical spacing used for section breaks or table rows in your documents.

Usage Notes

Vertical Tab Marker is typically used as part of a text preprocessing pipeline before data extraction.
Proper configuration is essential for accurate detection of vertical structure, especially in documents with variable line spacing.
For more information on related concepts, see Data Instance and Document Instance.

Ignore Control Characters

ControlCharacters

►

Specifies which control characters are ignored or replaced during text preprocessing.

A combination of the following flags:

Name	Value	Description
None	0	No control characters are ignored. The document's text is left unchanged; all spaces, line breaks, and other control characters are preserved.	►
Space	1	Space characters are ignored. All space (' ') characters are removed from the document prior to extraction. This can be useful for extracting values where spaces are not significant or may interfere with pattern matching.	►
NewLine	2	Newline (\r\n) pairs are replaced or removed. Newline pairs (carriage return + line feed) are replaced by a single space if 'Space' is not ignored, or removed entirely if 'Space' is also ignored. This helps flatten multi-line values or remove unwanted line breaks.	►
FormFeed	4	Form feed (\f) characters are replaced or removed. Form feed characters are replaced by a space if 'Space' is not ignored, or removed if 'Space' is also ignored. This is useful for documents with page breaks or legacy formatting.	►
CarriageReturn	8	Carriage return (\r) characters are removed. Carriage return characters are removed, effectively converting Windows line endings (\r\n) to UNIX style (\n). This can help normalize line endings for consistent extraction.	►
All	255	All supported control characters are ignored. Combines all options: spaces, newlines, form feeds, and carriage returns are all removed or replaced as described above.	►

The Control Characters enum defines options for removing or replacing specific control characters in a document's text prior to extraction.

These options are used by the Text Preprocessor to clean up or normalize whitespace and line breaks, improving the reliability of pattern matching and data extraction.

Multiple values can be combined to ignore several types of control characters at once.

Used By

Field Match Label Match Pattern-Based Flow List Match Pattern Match Word Match Labeled Region Extracted Semantic Ask AI

Text Preprocessor

Remarks

Overview

Key Features

Usage Guidance

Example Scenarios

Examples

1. Sample Document

2. Default Control Characters

3. Preprocessed Version

Properties

Purpose

How It Works

Example

Configuration Guidance

Usage Notes

Overview

How It Works

Configuration Guidance

Example 1: Field Extraction with Large Whitespace Gap

Example 2: Table Row with Multiple Columns

Notes

Purpose

How It Works

Configuration Guidance

Usage Notes

See Also

Used By