- Overview
- Help Status
-
Activity
- Attended Activity
-
Code Activity
- Apply Rules
- Attach
- Batch Transfer
- Burst Book
- Classify
- Clip Frames
- Convert Data
- Correct
- Deduplicate
- Detect Frames
- Detect Language
- Detect Language (Legacy)
- Dispose Batch
- Execute
- Export
- Extract
- Fill Data
- GPT Embed
- Image Processing
- Initialize Card
- Launch Process
- Mark Attachments
- Merge
- Recognize
- Redact
- Remove Level
- Render
- Route
- Send Mail
- Separate
- Spawn Batch
- Split Pages
- Split Text
- Text Transform
- Train Lexicon
- Translate
- XML Transform
-
Article
- AI Assistants
- AI Powered Features
- Batch Processing Workflows
- Document Scanning
- PDF Processing
- Search Overview
- Attachment Type
- Behavior
- Capture Device
- Classify Method
- CMIS Binding
- CMISQL Element
- Collation Provider
-
Command
- Action List - Create Copy Actions
- AI Chat
- Attachment Type
- Batch
-
Batch Folder
- Batch Folder - Add To Index
- Batch Folder - Assign Document Type
- Batch Folder - Classify Command
- Batch Folder - Collapse
- Batch Folder - Edit Type Assignment
- Batch Folder - Extract
- Batch Folder - Group Children
- Batch Folder - Insert Control Sheets
- Batch Folder - New Text Document
- Batch Folder - Remove From Index
- Batch Folder - Remove Level
- Batch Folder - Revert To Loose Pages
- Batch Folder - Set Field Value
- Batch Folder - Sort Children
- Batch Object
- Batch Page
- Batch Process
- CMIS Connection
- CMIS Document Link
- CMIS Export Map - Auto Map
- CMIS Folder Link
- CMIS Import Map - Auto Map
- CMIS Repository - Reset
- CMIS Type Definition - Generate Local Type
- Column Map - Auto Map
- Content Link - Remove Link
-
Content Type
- Content Type - Clean Overrides
- Content Type - Create Data Model
- Content Type - Create Local Resources Folder
- Content Type - Create Search Index
- Content Type - Delete Search Index
- Content Type - Generate Control Sheets
- Content Type - Purge Training
- Content Type - Rebuild Training
- Content Type - Submit Indexing Job
- Convert Data - Create Convert Actions
- Copy Base
- Data Connection
- Data Element - Remove Overrides
- Data Field Container
- Data Type - Convert To Value Reader
- EDI File
- Excel Document - Convert to CSV
- Exchange - Rebuild Search Folder
- Field Class - Purge Training
- File Store
- File System Link
- Folder - Remove Empty Subfolders
- FTP Link
- HTML Document
- HTTP Link
- JSON File
- Lexicon
- Machine - Tune File System
- Mail Link
- Mail Message
- Node
- OAuth Client Credentials - Test
- Object Library
- PDF Document
- Project - Remove Empty Subfolders
- PST File - Burst
- PST Link - Load Content
- Resource File
- Root
- Saved Query
- Search Index - Generate Subsets
- SFTP Link
- Text Document
- Value Reader - Convert To Data Type
- vCard - Expand Photo
- Word Document - Convert to PDF
- XML File
- ZIP Archive
- ZIP Link - Load Content
- Content Link
- Data Action
- Data Instance
- Export Definition
- Export Format
- Grooper Command Console (GCC)
- Import Definition
- Import Provider
-
IP Command
- Adjust Saturation
- Adjust Tint
- Analyze Photo
- Auto Adjust Levels
- Auto Color Balance
- Auto Convert
- Auto Deskew
- Auto Orient
- Auto QA
- Auto White Balance
- Barcode Detection
- Binarize
- Blank Page Detection
- Border Detect
- Box Detection
- Brightness Contrast
- Classify Image
- Color Detection
- Color Dropout
- Color Stamp Detection
- Colorize
- Compute Moments
- Contrast Stretch
- Convert
- Corner Detection
- Crop
- Dilate Erode
- Edge Detection
- Execute Profile
- Extract Channel
- Extract Features
- Extract Page
- Feature Dropout
- Filter
- Gamma Adjust
- Histogram
- Hough Lines
- Invert
- Line Detection
- Measure Entropy
- Mirror
- Negative Region Removal
- OCR Cleanup
- Patch Code Detection
- Posterize
- Projection Profile
- Randomize Defects
- Resize
- Rotate
- Shade Removal
- Shape Detection
- Solarize
- Sticky Note Detection
- Swap Channels
- Undistort
- Warp
- Lookup Specification
- Measurement
-
Node
- AI Assistant
- Batch Object
- Batch Process
- Batch Process Step
- CMIS Connection
- CMIS Repository
- Content Type
- Control Sheet
- Data Connection
- Data Element
- Data Rule
- Extractor Node
- File Store
- Folder
- IP Element
- Lexicon
- Machine
- Object Library
- OCR Profile
- Project
- Resource File
- Root
- Scanner Profile
- Separation Profile
- Training Page
- Work Queue
- OCR Engine
-
Property Converter
- Auto Deskew - Precision Converter
- Base Multi Culture Converter
- Blank Zero Converter
- Check List Converter
- CMIS Export Map - Column Converter
- CMIS Folder Reference - Converter
- CMIS Import Map - Field Converter
- CMIS Object - Choice Converter
- Code Expression - Converter
- Collection Converter
- Column Map - Column Converter
- Content Type - Unlimited Converter
- Degrees Converter
- Execute Command - Link Name Converter
-
Expandable Converter
- Base Culture Converter
- Batch Name Settings - Converter
- Border - Converter
-
Choice Converter
- Activity Processing - Queue Converter
- AI Chat Filter - Index Converter
- AI Search - Api Version Converter
- Apply Image Cleanup - Ip Profile Converter
- Azure OCR - Api Version Converter
- Azure OCR - Model Version Converter
- Barcode Extractor - Output Group Converter
- Base Combining Provider - Group Name Converter
- Batch - Step Converter
- Batch Process - Queue Converter
- Batch Process Step - Processing Scope Converter
- Batch Process Step - Queue Converter
- Batch Transfer - Process Converter
- Batch Transfer - Repository Converter
- Batch Transfer - Step Converter
- Build Fine Tuning File - Fill Method Converter
- Chat Filter - User Id Converter
- Chat Filter - User Name Converter
- Classify - Classification Level Converter
- Classify - Output Level Converter
- CMIS Export - Creatable Child Type Converter
- CMIS Export - Creatable Folder Converter
- CMIS Type Reference - Cmis Type Converter
- CMISQL Element - Qrderable Property Converter
- CMISQL Element - Queryable Property Converter
- CMISQL Element - Selectable Property Converter
- CMISQL Query - Primary Type Converter
- ColorTrac Scanner - Resolution Converter
- Comparison Filter - Function Name Converter
- Comparison Filter - Operand Type Converter
- Comparison Filter - Value Type Converter
- Comparison Predicate - Comp Op Converter
- Comparison Predicate - Value Converter
- Data Element - Display Label Converter
- Data Field - Sub Element Converter
- Database Table - Extended Property Name Converter
- Database Table - Table Name Converter
- EDI Schema Importer - X12Schema Converter
- Fill - Fill Method Converter
- Fill Data - Name Converter
- Fill Descendants - Name Converter
- Flag Item - Flag Reason Converter
- Generate Local Type - Doc Type Property Converter
- Import Provider - Disposition Converter
- Import Repository - Repository Converter
- ISIS Device - Device Name Converter
- Join Clause - Secondary Type Converter
- Label Info - Parent Label Converter
- Lexicon Lookup - Lookup Field Converter
- Lexicon Lookup - Target Field Converter
- Nested Table - Table Converter
- ODBC - Pg Odbc Dsn Converter
- Pattern-Based - Group Name Converter
- PDF Data Mapping - Font Name Converter
- Predicate List - Logical Operator Converter
- Read Metadata - Property Name Converter
- Reference - Group Name Converter
- Regular Expression - Group Converter
- Remove From Index - Index Name Converter
- Remove Overrides - Property Name Converter
- Reset - Step Converter
- Root - License Url Converter
- Route Definition - Process Converter
- Run Step - Step Converter
- Schema Mapping - Schema Name Converter
- Search Index - Index Name Converter
- Search Index Query - Index Name Converter
- Send To Test Batch - Flag Reason Converter
- Set Field Value - Value Converter
- String - Pdf Font Name Converter
- Task Filter - Activity Name Converter
- Task Filter - Process Name Converter
- Task Filter - Queue Converter
- Task Filter - Step Name Converter
- Text Document - Encoding Converter
- Text Document - Normalize Encoding Converter
- TWAIN Device - Compression Mode Converter
- TWAIN Device - Device Name Converter
- Update Process - Process Converter
- Update Process - Step Converter
- Value Selector - Target Field Converter
- XML Value Selector - Target Field Converter
- Double Range - Double Range Converter
- Expandable Info Converter
- Integer Range - Integer Range Converter
- Logical Border - Arrow Converter
- Logical Border - Logical Border Converter
- Logical Point - Logical Point Converter
- Logical Rectangle - Logical Rectangle Converter
- Logical Size - Logical Size Converter
- On Off Converter
- Percent Range - Percent Range Converter
- Point ExF - Converter
- Rectangle - Converter
-
Type Selector
- CMISQL Query - Where Element Converter
- Data Connection - Connection Converter
- Exchange - Auth Method Converter
- Execute - Command Converter
- Execute Activity - Activity Converter
- Execute Command - Command Converter
- Run Activity - Activity Converter
- SharePoint - Auth Method Converter
- Storage Type - Converter
- Web Service Lookup - Auth Method Converter
- Unit Border - Unit Border Converter
- Unit Line Length - Unit Line Length Converter
- Unit Point - Unit Point Converter
- Unit Range - Unit Range Converter
- Unit Rectangle - Converter
- Unit Size - Unit Size Converter
- Value Extractor - Converter
- JPEG 2000 - Ratio Converter
- Logical Value - Simple Value Converter
- Logical Value - Universal Value Converter
- Node Information - Value Converter
- Page Filter Converter
- Path Expression - Converter
- Percent Converter
- Pg Dictionary Converter
- Pg Flags Converter
- Pg Ref Collection Converter
- Pg String Collection Converter
- Pg Type Display Name Converter
- Product License - Quantity Converter
- Read Only Converter
- Rectangle - Inches Converter
-
Simple Converter
- Click to Edit Converter
- Data Action - Source Element Converter
- Data Action - Target Element Converter
- IN Predicate - In Predicate Values Converter
- OAuth Authentication - Login Converter
- Pattern Match - Group Options Converter
- Pg Format Converter
- Product License - Quantity Used Converter
- Project - Projects Converter
- Publish To Repository - Repository Converter
- Result Set Options - Sort Order Converter
- Review - View List Converter
- Stats Query - Name List Converter
- String - Pg Text Lines Converter
- Type Permissions - Command Converter
- Word Match - Term Options Converter
- Text Rendering - Size Converter
- Time Range Converter
- Time Ranges Converter
- Time Span Converter HMS
- Timer Service - Time Converter
- Times Converter
- Unit Value - Unit Value Converter
- Word Match - Integer Range Converter
-
Property Editor
- Anchor Definition - Location Editor
- Barcode Detected - Preview Image Editor
-
Choice Property Editor
- Azure Document Intelligence OCR - Model Editor
- Base Culture Editor
-
Check List Editor
- AI Assistant - Search Index Editor
- AI Table Reader - Included Columns Editor
- Batch Filter - Activity Editor
- Batch Filter - Process Editor
- Batch Filter - Status Editor
- Batch Filter - Step Editor
- CMIS Type Reference - Secondary Types Editor
- Data Fill Method - Included Children Editor
- Delete Fine Tuned Model - Models Editor
- Generate Local Type - Property Check List
- IMAP - folder Editor
- Publish To Repository - Repository Editor
- Rebuild Indexes - Table Names Editor
- Reset - Step Checklist Editor
- Stats Query - Name List Editor
- Table Mapping - Column Check List
- Text Analysis - Entity Type Editor
- Type Permissions - Command Editor
- Data Connection - Table Name Editor
- Delete Fine Tuned Model - Model Editor
- GPT Embed - Embeddings Model Editor
- LLM Connector - Chat Model Editor
- LLM Connector - Embeddings Model Editor
- Return Value - Column Editor
- SQL Server - Database Name Editor
- Start Fine Tuning Job - Model Editor
- CMIS Compound Type - Editor
- CMISQL Query - Query Editor
-
Code Property Editor
- AI Chat Filter - Filter Editor
- AI Schema Extractor - Schema Editor
- Ask AI - Schema Editor
-
Code Expression Editor
- Batch Process Step - Next Step Editor
- Batch Process Step - Should Submit Editor
- Calculate Value - Value Expression Editor
- CMIS Export Map - Expression Editor
- CMIS Import Map - Expression Editor
- Code Expression - Editor
- Column Map - Expression Editor
- Computed Field - Expression Editor
- Concat - Trigger Editor
- Content Type - Caption Editor
- Copy Base - Trigger Editor
- Custom Statement - Statement Editor
- Data Export - Alternate Database Editor
- Data Field - Default Value Editor
- Data Field - Field Expression Editor
- Data Rule - Trigger Editor
- Data Section - Caption Editor
- Expression Set - Default Value Editor
- Expression Set - Field Expression Editor
- IP Element - Next Step Editor
- IP Element - Should Execute Editor
- Lookup Specification - Trigger Editor
- Metadata Options - Value Editor
- Path Expression - Editor
- Raise Issue - Log Message Editor
- Remove - Trigger Editor
- Require Value - Log Message Editor
- Text Transform - Record Editor
- Variable Definition - Expression Editor
- Create Table - Statement Editor
- Data Field Container - Css Editor
- Database Lookup - SQL Query Editor
- Embedded Lexicon - Local Entries Editor
- Json Property Editor
- KVP Editor
- Lexicon - Lexicon Link Code Editor
- List Match - Local Entries Editor
- Mail Import - IMAP Query Editor
- Pattern Match - Output Format Editor
- Regex Property Editor
- Search Classifier - Filter Editor
- Search Index - Filter Editor
- Search Index Query - Filter Editor
- Search Index Query - Order By Editor
- Search Index Query - Search Editor
- Send Mail - Template Editor
- String List Editor
- Submit Indexing Job - Select Editor
- Subset Filter - Filter Editor
- Text Property Editor
- Web Service - Header Editor
- Web Service Lookup - Post Data Editor
- Web Service Lookup - Url Editor
- Word Match - Output Format Editor
- XML Lookup - Selector Editor
- XML Transform - Transform Editor
- XML Value Selector - Path Editor
- Folder Browse Editor
- LDAP - ACL Editor
- OAuth Authentication - Login Editor
- Object Collection Editor
- Object Properties Editor
- Open File Editor
-
Reference Editor Base
-
Node Reference Editor
- Archive - Folder Editor
- Batch Process Step Editor
- Content Type Editor
- Custom Statement - Scope Editor
- Data Action - Action Element Editor
- Data Action - Source Editor
- Data Action - Target Editor
- Data Field Container - Rule Editor
- Data Rule - Scope Editor
- Database Cleanup - Folder Editor
- Dispose Batch - Target Folder Editor
- Execute Rule - Rule Editor
- Field Match - Field Editor
- Generate Subsets - Field Editor
- Grid Layout - Header Column Editor
- Piece Info Options - Key Column Editor
- Piece Info Options - Value Column Editor
- Return Value - Field Editor
- Set Field Value - Field Editor
- Table Mapping - Scope Editor
- Task Filter - Batch Editor
- Test Batch Editor
- Text Transform - Scope Editor
- Train Lexicon - Scope Editor
- Virtual Table Definition - Collection Editor
- Web Service - Definition File Editor
- Ordered Reference Editor
-
Reference List Editor
- All Nodes Reference Editor
- Behavior - Field List Editor
- Bookmark Options - Data Element Editor
- Build Fine Tuning File - Batch Editor
- Content Types Editor
- Correct - Fields Editor
- Data Fill Method - Included Descendants Editor
- Data Model - Style Sheets Editor
- Data Rule - Required Elements Editor
- Data Values - Included Elements Editor
- Extract - Data Element Filter Editor
- Indexing Behavior - Included Elements Editor
- JSON Data Mapping - Included Elements Editor
- Lexicon - Lexicons Editor
- Piece Info Options - Element Editor
- Project - Projects Editor
- Redact - Extractors Editor
- Redact - Fields Editor
- Require Value - Required Elements Editor
- Section Extract Method - Included Descendants Editor
- Thumbnail View - IP Profiles Editor
- Transaction Detection - Field List Editor
-
Node Reference Editor
- Sample Image Collection - Editor
- Value Extractor - Editor
- Zone Editor
- Schema Importer
- Section Extract Method
- Separation Provider
- Service Instance
- Storage Type
- Table Extract Method
- Task View
-
UI Element
-
Control
- Active Task List
- AI Helper
- Batch Info Tab
- Batch Info Viewer
- Batch List
- Batch Manager
- Candidate List
- Card List
- Chat Console
- Class Help
- CMIS Repository Searcher
- CMIS Tree Browser
- CMIS Type Tree
- Code Editor
- Complete List
- Content Viewer
- Context Menu
- Conversation Viewer
- Data Element Tester
- Data Grid
- Data Grid Document
- Data Grid Element
- Data Inspector
- Data Tree
-
Design Tab
- AI Assistant - Chat History
- Batch
- Batch Folder - General
- Batch Page - General
- Batch Process
- Batch Process Step
- CMIS Connection - General
- CMIS Repository
- Content Type
- Control Sheet - General
- Data Connection - General
- Data Element
- Data Rule - Tester
- Extractor Node - Tester
- Field Class - Weightings
- Folder - Batches
- IP Element Container - Tester
- IP Step - Tester
- Lexicon - General
- Machines
- Node
- OCR Profile - Tester
- Processing Queue - Workers
- Project - Usage
- Resource File - General
- Root
- Training Page - General
- Design Tab Host
- Diagnostics Viewer
- Document Searcher
- Document Viewer
- Expression Grid
- Extractor Builder
- FRX Grid
- FRX Visualizer
- Image Editor
- Image Print Preview
- Image Viewer
- Instance Searcher
- Label Set Editor
- List Searcher
- Lookup Fields
- Lookup Results
- Node Finder
- Node Report
- Object List
- OCR Viewer
- Page Navigator
- Profile Browser
- Property Grid
-
Property Grid Editor
- ACL Editor
- Anchor Editor
- Choice Editor
- CMIS Query Editor
- Code Property Editor
- Collation Editor
- Collection Editor
- Extractor Property Editor
- Folder Editor
- List Editor
- Multi Reference Editor
- OAuth Log-in Editor
- Object Editor
- Ordered Reference Editor
- Preview Image Editor
- Reference Editor
- Sample Image Editor
- Zone Editor
- Property Help
- Query Editor
- Query Helper
- Query List
- Recognition Tester
- Rep Info Panel
- Review Tab
- Search Result Cards
- Separation List
- Service Collection
- Splitter
- Stats Report
- Stats Result Set
- Stats Viewer
- Tab List
- Task List
- Test Source
- Tree Viewer
- Upload Dialog
- Weightings List
- Web Page
-
Control
- Value Extractor
- Variable Provider
-
Other Configuration Types
- API Key
- Archive Info
- Border
- Capture Settings
- Character Class Filter
- Chat Parameters
- CMIS Object
- CMIS Property Definition
- Code39Settings
-
Connected Object
- Batch Filter
- Chat Filter
- Database Row
-
Embedded Object
- AI Chat Filter
- AI Chat Settings
- AI Generator
- Anchor Definition
- Attachment Rule
- Auto Complete Settings
- Barcode Reader
- Batch Creation Settings
- Batch Name Settings
- Bookmark Options
- Bot Connector
- Boundary Detector
- Chunk Settings
- Cluster Parameters
- CMIS Export Map
- CMIS Folder Reference
- CMIS Import Map
- CMIS Type Definition
- CMIS Type Reference
- Code Expression
- Column Map
- Command Options
- Computed Field
- Content Mapping
- Custom Statement
- Data Element Extension
- Data Element Profile
- Data Fill Method
- Data Generator
- Edge Adjustment
- Embedded Lexicon
- Environment Options
- Execute Step
- Expression Set
- Field Annotation
- Field Mapping
- File Reference
- Folder Level Info
- FRX Options
- FTP Repository Configuration
- Fuzzy Lookup Options
- Horizontal Tab Marker
- HTTP Auth Method
- HTTP Resource
- Hyperlink Selector
- Image Segmentation Options
- Import Schedule
- Index Stats
- Label Info
- Label Set
- Label Version
- Layout Provider
- Line Periodicity Detector
- LLM Provider
- Lucene Query
- Metadata Options
- Multiline Row Settings
- OCR Layer
- OCR Repair Options
- OMR Box
- Page Attachment Rule
- Paragraph Marker
- Path Expression
- PDF Expand Method
- Permission Set
- Piece Info Options
- Quoting Method
- Region Definition
- Repository Configuration
- Repository Option
- Resource Reference
- Result Filter
- Result Processor
- Result Set Options
- Return Value
- Route Definition
- Sample Image Collection
- Schema Mapping
- Search Filter
- Separate Action
- Service Deployment
- Service Stats
- Stats Query
- Subset Filter
- Table Header Detector
- Table Mapping
- Table Row Detector
- Text Preprocessor
- Transaction Extractor
- Type Permissions
- Value Lookup
- Value Selector
- Variable Definition
- Vector Search Options
- Vertical Tab Marker
- Virtual Table Definition
- XML Value Selector
- Node Query
- Purge Folder
- Search Index Query
- Task Filter
- Constrained Wrap Options
- Culture Data
- Dash Detector
- Database Connection Settings
- Defect Generator
- Double Range
- Dropout Method
- Event Filter
- Fiche Card Layout
- Folder Level Options
- Horizontal Alignment Settings
- HTTP Authentication Method
- Image Compression
- Image Info
- Integer Range
- Line Snap Options
- Margin Detector
- Multi Line Settings
- Node Information
- PDF Burst Settings
- PDF Page Generator
- PDF Render Settings
- Percent Range
- Rectangle
- Region Detector
- Regular Expression
- Remote Repository
- Row Alignment Settings
- Scan Once Settings
- Semantic Quoting Query
- SFTP Repository Configuration
- Shell Execute Info
- Sort Specification
- System Config
- Text Wrap Options
- TIFF Page
- Transaction Layout Detection
- Vertical Wrap Detection
- Advanced Topics
-
Enumerations
-
Grooper
- CharacterCasing
- ConcurrencyMode
- DatabaseStatus
- DCTModes
- EventType
- NodeAttributes
- Pages
- PixelFormat
- ProcessingScope
- ProcessingStatus
- ResultOrder
- SimplePixelFormat
-
Grooper.Activities
- ActionType
- BatchDisposition
- BatchNameSuffixEnum
- BodyRenderingMethod
- ComparisonMode
- DuplicateDisposition
- ExecuteType
- ExecutionScope
- ExtractMode
- FilterType
- MatchActions
- OcrAssistMode
- PageExtractMode
- ProblemDisposition
- ReclassifyModes
- RepairScope
- RouteMethod
- SaveDisposition
- SharedBehaviorModes
- SpawnMethod
- StatsLoggingMode
- TextExtractMode
- TrainingScope
- XmlSource
- XmlTarget
-
Grooper.Capture
- FeedOrientation
- ImportType
- MissDispositionEnum
- PageDirection
- ScanningSpeed
- TwainCompressionModes
-
Grooper.Capture.ColorTrac
- ColorFormat
- PageSizeMode
- PaperEndCondition
- PaperJustification
- ScanSpeed
- StandardPageSize
-
Grooper.Cloud
- ApiRegionEnum
- ContentLayout
- HttpVerbs
- MessageFormats
- MetadataModes
- TranslateDisposition
-
Grooper.CMIS
- AuthenticationProvider
- CmisProtocol
- ImportModes
- LoadScope
- NamingMethods
- OrderByDirection
- TransferScopes
-
Grooper.Core
- ActivateModes
- ArrayActions
- AttachmentPosition
- BrowserSuggestMode
- CalculateModes
- CalculateModes
- CaptureScope
- ClassificationLevel
- CompareMode
- ConflictDispositions
- ConflictResolution
- ControlCharacters
- CreateModes
- DedupMode
- DispositionType
- DuplicateFilenameResolution
- FolderRelativePosition
- FolderRelativePosition
- FooterModes
- FormatOptions
- FuzzyMatchMode
- GroupingColumn
- IdfModes
- IssueDisposition
- JsonLayout
- LexiconType
- MergeModes
- MissDispositions
- MissDispositions
- NumberFormats
- OxiElement
- PaginationType
- ParagraphOptions
- PdfBuildOptions
- PopulationMethod
- ProcessingLevel
- PropagationMode
- SegmentType
- SortColumns
- SortDirection
- SortDirections
- SortDirections
- SortOption
- SortOrder
- StandardWeightings
- TabOptions
- TaskScope
- TfModes
- TimeFrames
- TimeGrouping
- TrainingScopes
- TriggerModes
- TypeKind
- TypeModes
- TypeOperation
- UserTrainingMode
- ValueInterpretations
- ZIPDispositions
-
Grooper.EDI
- AttachmentNamingMethods
- DataDisposition
- NamingMethods
- NamingMethods
-
Grooper.Extract
- AdjustmentMethod
- AlignmentMode
- CollationType
- CombineType
- CompassDirection
- ConfidenceModes
- ContextScopes
- CultureScopes
- ExecutionScope
- FlowDirection
- GroupingType
- HorizontalDataAlignment
- HorizontalDataAlignment
- LabelLayout
- LookupOption
- MappingType
- OmrBoxDirection
- OmrFlowDirection
- OmrMode
- OutputValueOptions
- ReadDirection
- ReadMethods
- ReferencePointPosition
- ROIModes
- RowDetectionMode
- RowMatchOptions
- SecondaryExtractMethod
- SecondaryExtractTrigger
- SplitPositionEnum
- TableRowAlignment
- TableStyles
- VerticalDataAlignment
- WordTransform
-
Grooper.GPT
- AuthorizationMethod
- BooleanOperator
- BuiltInFieldKinds
- DocumentLinkingOptions
- FieldAlignMode
- IndexOperations
- LambdaFunction
- LayoutComponentTypes
- OperationType
- QueryTypes
- ReasoningEffortLevels
- ResultOrder
- RetrievalOptions
- RowAlignMode
- SearchModes
- SectionAlignMode
- ServiceTiers
- VerbosityLevels
-
Grooper.IP
- AdaptiveKernelType
- AngleCategory
- Axis
- BinarizationMethod
- ChannelNumber
- Code39Options
- ColorSpaceType
- CombDetectionType
- CompressionMode
- Connectivity
- CropMethod
- CurveType
- DetectMethod
- FeatureType
- FillMethod
- FilteringLevel
- FilterTypeEnum
- HarrisFilterType
- HoughLevel
- ImageEdges
- InpaintMethod
- MaskShape
- MaskSize
- MeasurementType
- Method
- OneDimSymbology
- OperationType
- OperationType
- Pdf417Options
- PostSymbology
- ProcessingResolution
- ProgressionOrder
- ReadDirection
- ReadingQuality
- ResizeInterpolationMode
- SizeMethod
- Symbology
- TwoDimSymbology
- WarpInterpolationMode
-
Grooper.Messaging
- BodyHandling
- Orientation
- PaperKind
- SaveAction
- SelectorKind
-
Grooper.OCR
- AccuracyLevels
- BaseCharacterSetEnum
- DetectionMethod
- EngineModeEnum
- FontPitchMode
- LexMode
- PageOrientation
- PageOrientation
- SegmentationModeEnum
- SynthesisMethodEnum
-
Grooper.Office
- SaveMethod
-
Grooper.PDF
- CompressionMode
- ImageLayout
- PDFAComplianceLevels
- PdfBorderStyle
- PdfDisplayMode
- PdfPermissions
- PdfViewerOptions
- SearchableTextFormat
- TargetColorFormat
-
Grooper.Services
- DaysOfWeek
-
Grooper.Services.CMIS
- ConnectMethod
- ContentMode
- FileType
- FormOverlayType
- MergeAction
-
Miscellaneous
- BaseTypeId
- CharacterCasing
- CompressionLevel
- ContentAlignment
- DateTimeStyles
- FileAttributes
- FontStyle
- Formatting
- HorizontalAlignment
- Keys
- NumberStyles
- RegexOptions
- ThreadPriority
- UriKind
-
Grooper
Word Match
Extracts individual words and multi-word phrases (N-grams) from document text for use in classification, data extraction, and normalization.
Remarks
Word Match is designed to locate and output single words or contiguous multi-word phrases from document text. It is a foundational tool for both data extraction and document classification in Grooper, enabling the identification of context-rich features such as names, titles, and key phrases.
What It's For
The primary purpose of Word Match is to break down document text into meaningful units—words and phrases—that can be used as features for classification, or as extracted values for data fields. By capturing not just individual words, but also sequences of words (N-grams), Word Match provides richer context for downstream processes. For example, the phrase "remittance advice" is more informative than the words "remittance" and "advice" considered separately.
Typical use cases include:
- Extracting person names, organization names, or other multi-word entities.
- Generating features for machine learning-based classification.
- Normalizing and validating extracted values against vocabularies or lists.
- Supporting advanced scenarios such as correcting OCR errors or handling multilingual documents.
How It Works
Word Match operates in two main steps:
- Word Identification:
The extractor scans the document text and identifies words using a regular expression. This allows for language-specific, length-specific, or error-tolerant matching. - Phrase Assembly:
Adjacent words are grouped into phrases (N-grams) of configurable length. All possible contiguous N-grams are produced, subject to join rules and optional lookups or validation.
The output consists of all valid words and phrases found in the text, ready for use in classification, extraction, or normalization workflows. Phrase extraction is especially valuable for scenarios where context matters, such as distinguishing between "John Doe" and "Doe John", or identifying key phrases for document categorization.
Integration and Advanced Scenarios
Word Match integrates seamlessly with Grooper's extraction and classification system. It supports:
- Validation and normalization of extracted values using vocabularies and lookups.
- Custom output formatting for phrases.
- Handling of OCR errors and multilingual content.
- Use in both field extraction and classification activities.
Diagnostics
When diagnostic logging is enabled, Word Match produces artifacts that help validate and troubleshoot extraction:
- Logs of matched words and phrases.
- Timing and performance metrics for extraction steps.
- Summaries of results and any validation issues.
Properties
| Name | Type | Description | |||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Matching | |||||||||||||||||||||||
| Word Pattern | String | ► |
A regular expression pattern used to identify individual words in document text. The 'Word Pattern' property defines the rule for what constitutes a "word" during extraction. This pattern is applied to the document text to locate sequences of characters that should be treated as words for further processing and phrase assembly. How It Works
Impact
Examples
Usage Scenarios
> Use diagnostics to review which words were matched and to tune the pattern for your data. |
||||||||||||||||||||
| Prefix Pattern | String | ► |
Defines an optional prefix which must occur immediately before each match. The 'Prefix Pattern' property allows you to specify a regular expression that must be present immediately before each match found by the Text Match extractor. This enables context-sensitive extraction, ensuring that only values preceded by a specific pattern, label, or structural element are returned. PurposeUse this property to restrict matches to those that occur after a particular label, whitespace, line break, or other context. This is especially useful for extracting labeled values, enforcing boundaries, or avoiding false positives in complex documents. Configuration Guidance
Examples
Impact
Usage Scenarios
> Use diagnostics to review which prefix patterns were applied and to troubleshoot extraction boundaries. |
||||||||||||||||||||
| Suffix Pattern | String | ► |
Defines an optional suffix which must occur immediately after each match. The 'Suffix Pattern' property allows you to specify a regular expression that must be present immediately after each match found by the Text Match extractor. This enables context-sensitive extraction, ensuring that only values followed by a specific pattern, label, or structural element are returned. PurposeUse this property to restrict matches to those that occur before a particular label, whitespace, line break, or other context. This is especially useful for extracting values with trailing units, enforcing boundaries, or avoiding false positives in complex documents. Configuration Guidance
Examples
Impact
Usage Scenarios
> Use diagnostics to review which suffix patterns were applied and to troubleshoot extraction boundaries. |
||||||||||||||||||||
| Environment | Environment Options | ► |
Provides configuration for merge variables and culture settings used by regex-based extractors. OverviewThe Environment Options class controls how merge variables are resolved and how culture information is applied during extraction. Merge variables are referenced in regular expressions using the syntax @VariableName. These variables can represent lists of values, culture-specific data, or reusable regex fragments. Built-In Merge VariablesSeveral built-in merge variables are available (see Variable Providers), exposing culture-specific lists such as @DayNames, @MonthNames, and @CurrencySymbols. For example, the @DayNamesAbbreviated variable expands to a list of day abbreviations appropriate for the current culture:
These variables adapt automatically to the culture of the document or to a specified override. Custom Merge VariablesYou can define custom merge variables by referencing Lexicons that contain the desired values or key-value pairs. Custom variables are useful for:
To define custom variables, use the 'Value Lists' and 'Snippet Libraries' properties to reference appropriate Lexicons. Each lexicon entry becomes a merge variable, which can be injected into your regular expressions using the @VariableName syntax. Culture SettingsThe culture used for variable expansion can be controlled via the 'Culture Override' and 'Culture Scope' properties. By default, variables are generated using the culture of the input document, but you can force a specific culture or restrict processing to certain languages using these options. This is especially useful when extracting data from mixed-language documents, or when you need to standardize extraction behavior across different locales. Usage Guidance
|
||||||||||||||||||||
| Options | |||||||||||||||||||||||
| Case Sensitive | Boolean | ► |
Specifies whether matching should be performed in a case-sensitive manner.
False
The 'Case Sensitive' property controls whether the regular expression pattern, prefix, and suffix matching performed by the Text Match extractor will distinguish between uppercase and lowercase letters. PurposeEnable this property when the capitalization of text is meaningful for your extraction scenario, such as distinguishing between proper names, acronyms, or case-specific labels. Configuration Guidance
Impact
Examples
> Use diagnostics to verify which matches were found and to troubleshoot case-related extraction issues. |
||||||||||||||||||||
| Preprocessing | Text Preprocessor | ► |
Applies configurable text preprocessing to a document's content before regular expression extraction. The Text Preprocessor enables advanced manipulation of control characters in a document's text, allowing regular expressions to match or ignore structural elements such as line breaks, paragraph boundaries, page breaks, tabs, and spaces. OverviewText preprocessing is performed immediately before extraction, transforming the document's text to improve the accuracy and flexibility of pattern matching. This is especially useful when data values span multiple lines, are separated by large whitespace gaps, or are affected by inconsistent formatting. Key Features
Usage Guidance
Example Scenarios
For more details, see the documentation for Paragraph Marker, Horizontal Tab Marker, and Vertical Tab Marker. Examples1. Sample DocumentConsider the following sample document.
2. Default Control CharactersWith no preprocessing options enabled, the document data will look like this. Whitespace gaps, no matter how large,
are represented by a single space character. A
3. Preprocessed VersionPreprocessing the document with paragraph marking and tab marking will place a tab character '\t' at each large whitespace gap, and replace newline pairs '\r\n' occuring inside a paragraph with a space.
|
||||||||||||||||||||
| Word Lookup | Value Lookup | ► |
An optional lookup used to validate, normalize, or correct individual words after extraction. The 'Word Lookup' property allows you to apply a vocabulary, exclusion list, or fuzzy matching logic to each word identified by the extractor. This is useful for enforcing allowed values, correcting OCR or spelling errors, and mapping words to standardized forms. How It Works
Impact
ExamplesUsage Scenarios
> Use diagnostics to review which words were validated, corrected, or excluded by the lookup. |
||||||||||||||||||||
| Chunk Size | Int32 | ► |
The chunk size, in pages, to use when processing large documents.
Blank Zero Converter
1000
The 'Chunk Size' property enables chunked processing for large documents, allowing the Text Match extractor to break the document into smaller segments and process each chunk separately. PurposeUse this property to optimize extraction performance and memory usage when working with documents containing many pages (hundreds or thousands). Configuration Guidance
Impact
Usage Scenarios
> Use diagnostics to review chunk boundaries, extraction times, and the number of results produced per chunk. |
||||||||||||||||||||
| Phrases | |||||||||||||||||||||||
| Phrase Size | Int32 | ► |
Specifies the number of words to include in each extracted phrase (N-gram). Can be one of the following types:
The 'Phrase Size' property determines how many adjacent words are grouped together to form a phrase, also known as an N-gram. This enables the extractor to capture not only single words, but also meaningful multi-word combinations that provide richer context for classification and data extraction. How It Works
Impact
ExamplesFor the text 'the quick brown fox':
Usage Scenarios
> Use diagnostics to review the number and type of phrases produced, and adjust phrase size for your scenario. |
||||||||||||||||||||
| Join Pattern | String | ► |
A regular expression pattern that determines whether two words can be joined together to form a phrase. The 'Join Pattern' property controls how words are assembled into phrases (N-grams) by specifying what characters or whitespace are allowed between them. Only words separated by content matching this pattern are considered joinable. How It Works
Impact
ExamplesFor the text
Usage Scenarios
> Use diagnostics to review which word pairs were joinable and to tune the pattern for your documents. |
||||||||||||||||||||
| Term Options | Group Options[] | ► |
Configures per-term lookup and normalization options for each word in a phrase (N-gram). The 'Term Options' property allows you to specify individual lookup, filtering, and normalization settings for each term (word) in a phrase. This enables fine-grained control over how each word in a multi-word phrase is validated, corrected, or standardized. How It Works
Impact
ExamplesUsage Scenarios
> Use diagnostics to review which terms were validated or normalized for each phrase. |
||||||||||||||||||||
| Phrase Lookup | Value Lookup | ► |
An optional lookup used to validate, normalize, or correct the entire phrase (N-gram) after assembly. The 'Phrase Lookup' property allows you to apply vocabulary, exclusion, or fuzzy matching logic to the full phrase after it has been assembled from individual terms. This is useful for enforcing allowed multi-word values, correcting phrase-level errors, or mapping phrases to standardized forms. How It Works
Impact
ExamplesUsage Scenarios
> Use diagnostics to review which phrases were validated, corrected, or excluded by the lookup. |
||||||||||||||||||||
| Minimum Term Hits | Int32 | ► |
Specifies the minimum number of term lookups that must succeed for a phrase to be considered valid.
0
The 'Minimum Term Hits' property controls how many individual terms in a phrase (N-gram) must pass their respective lookups or validations for the entire phrase to be included in the output. This is useful for scenarios where partial matches are acceptable, or where some terms may be optional or prone to errors. How It Works
Impact
ExamplesFor a trigram phrase such as
Usage Scenarios
> Use diagnostics to review how many phrases were included or excluded based on term hits, and adjust this property to balance recall and precision for your scenario. |
||||||||||||||||||||
| Output | |||||||||||||||||||||||
| Output Format | String | ► |
An optional format string that transforms the final output value for each phrase. The 'Output Format' property allows you to customize how extracted phrases are presented by specifying a format string with placeholders for each term in the N-gram. This is useful for reordering terms, adding punctuation, or producing output in a specific style required for downstream processing or export. How It Works
Impact
ExamplesFor a trigram match on For a bigram match on Usage Scenarios
> Use diagnostics to review formatted output and ensure the format string produces the desired results. |
||||||||||||||||||||
| Result Filter | Result Filter | ► |
Defines rules for filtering the result set produced by extraction operations. The Result Filter allows you to configure a set of criteria that each Data Instance must meet to be included in the final result set. Results that do not match the specified conditions are excluded, enabling precise control over which values are retained for downstream processing or export. Configuration and Usage
Typical Scenarios
Related Types
For more information, see the documentation for Data Instance, Value Extractor, and Result Filter properties. |
||||||||||||||||||||
| Result Set Options | Result Set Options | ► |
Configures post-processing options for a set of extracted results, enabling value normalization, confidence adjustment, sorting, filtering, and other result set controls. The Result Set Options class provides a flexible set of controls for shaping the output of data extraction and classification activities in Grooper. It allows you to define how individual results are adjusted, how the overall result set is filtered or ordered, and how output values are normalized for downstream use. OverviewUse this class to:
These options are commonly configured on Data Fields, Data Types, or other extraction elements to ensure that the output meets business requirements and is ready for validation, export, or further processing. Key Scenarios
Processing FlowWhen applied, the options in this class are processed in a defined sequence:
This ensures that the final output is both clean and conforms to the requirements of downstream consumers. Usage Guidance
|
||||||||||||||||||||
See Also
Value LookupGroup OptionsEnvironment OptionsText PreprocessorResult FilterResult Set Options
Used By
Document TypeExtract FromData ColumnData FieldLexicalRules-BasedSpell CorrectorAuto Complete SettingsParagraph MarkerMetadata OptionsOCR LayerLine Periodicity DetectorFixed WidthLabeled ValueSelect PageData TypeOCR ReaderDividerAnchorSimple
