- Overview
- Help Status
- 
        Activity
        - Attended Activity
- 
    Code Activity
      - Apply Rules
- Attach
- Batch Transfer
- Burst Book
- Classify
- Clip Frames
- Convert Data
- Correct
- Deduplicate
- Detect Frames
- Detect Language
- Detect Language (Legacy)
- Dispose Batch
- Execute
- Export
- Extract
- Fill Data
- GPT Embed
- Image Processing
- Initialize Card
- Launch Process
- Mark Attachments
- Merge
- Recognize
- Redact
- Remove Level
- Render
- Route
- Send Mail
- Separate
- Spawn Batch
- Split Pages
- Split Text
- Text Transform
- Train Lexicon
- Translate
- XML Transform
 
 
- 
        Article
        - AI Assistants
- AI Powered Features
- Batch Processing Workflows
- Document Scanning
- PDF Processing
- Search Overview
 
- Attachment Type
- Behavior
- Capture Device
- Classify Method
- CMIS Binding
- CMISQL Element
- Collation Provider
- 
        Command
        - Action List - Create Copy Actions
- AI Chat
- Attachment Type
- Batch
- 
      Batch Folder
        - Batch Folder - Add To Index
- Batch Folder - Assign Document Type
- Batch Folder - Classify Command
- Batch Folder - Collapse
- Batch Folder - Edit Type Assignment
- Batch Folder - Extract
- Batch Folder - Group Children
- Batch Folder - Insert Control Sheets
- Batch Folder - New Text Document
- Batch Folder - Remove From Index
- Batch Folder - Remove Level
- Batch Folder - Revert To Loose Pages
- Batch Folder - Set Field Value
- Batch Folder - Sort Children
 
- Batch Object
- Batch Page
- Batch Process
- CMIS Connection
- CMIS Document Link
- CMIS Export Map - Auto Map
- CMIS Folder Link
- CMIS Import Map - Auto Map
- CMIS Repository - Reset
- CMIS Type Definition - Generate Local Type
- Column Map - Auto Map
- Content Link - Remove Link
- 
      Content Type
        - Content Type - Clean Overrides
- Content Type - Create Data Model
- Content Type - Create Local Resources Folder
- Content Type - Create Search Index
- Content Type - Delete Search Index
- Content Type - Generate Control Sheets
- Content Type - Purge Training
- Content Type - Rebuild Training
- Content Type - Submit Indexing Job
 
- Convert Data - Create Convert Actions
- Copy Base
- Data Connection
- Data Element - Remove Overrides
- Data Field Container
- Data Type - Convert To Value Reader
- EDI File
- Excel Document - Convert to CSV
- Exchange - Rebuild Search Folder
- Field Class - Purge Training
- File Store
- File System Link
- Folder - Remove Empty Subfolders
- FTP Link
- HTML Document
- HTTP Link
- JSON File
- Lexicon
- Machine - Tune File System
- Mail Link
- Mail Message
- Node
- OAuth Client Credentials - Test
- Object Library
- PDF Document
- Project - Remove Empty Subfolders
- PST File - Burst
- PST Link - Load Content
- Resource File
- Root
- Saved Query
- Search Index - Generate Subsets
- SFTP Link
- Text Document
- Value Reader - Convert To Data Type
- vCard - Expand Photo
- Word Document - Convert to PDF
- XML File
- ZIP Archive
- ZIP Link - Load Content
 
- Content Link
- Data Action
- Data Instance
- Export Definition
- Export Format
- Grooper Command Console (GCC)
- Import Definition
- Import Provider
- 
        IP Command
        - Adjust Saturation
- Adjust Tint
- Analyze Photo
- Auto Adjust Levels
- Auto Color Balance
- Auto Convert
- Auto Deskew
- Auto Orient
- Auto QA
- Auto White Balance
- Barcode Detection
- Binarize
- Blank Page Detection
- Border Detect
- Box Detection
- Brightness Contrast
- Classify Image
- Color Detection
- Color Dropout
- Color Stamp Detection
- Colorize
- Compute Moments
- Contrast Stretch
- Convert
- Corner Detection
- Crop
- Dilate Erode
- Edge Detection
- Execute Profile
- Extract Channel
- Extract Features
- Extract Page
- Feature Dropout
- Filter
- Gamma Adjust
- Histogram
- Hough Lines
- Invert
- Line Detection
- Measure Entropy
- Mirror
- Negative Region Removal
- OCR Cleanup
- Patch Code Detection
- Posterize
- Projection Profile
- Randomize Defects
- Resize
- Rotate
- Shade Removal
- Shape Detection
- Solarize
- Sticky Note Detection
- Swap Channels
- Undistort
- Warp
 
- Lookup Specification
- Measurement
- 
        Node
        - AI Assistant
- Batch Object
- Batch Process
- Batch Process Step
- CMIS Connection
- CMIS Repository
- Content Type
- Control Sheet
- Data Connection
- Data Element
- Data Rule
- Extractor Node
- File Store
- Folder
- IP Element
- Lexicon
- Machine
- Object Library
- OCR Profile
- Project
- Resource File
- Root
- Scanner Profile
- Separation Profile
- Training Page
- Work Queue
 
- OCR Engine
- 
        Property Converter
        - Auto Deskew - Precision Converter
- Base Multi Culture Converter
- Blank Zero Converter
- Check List Converter
- CMIS Export Map - Column Converter
- CMIS Folder Reference - Converter
- CMIS Import Map - Field Converter
- CMIS Object - Choice Converter
- Code Expression - Converter
- Collection Converter
- Column Map - Column Converter
- Content Type - Unlimited Converter
- Degrees Converter
- Execute Command - Link Name Converter
- 
    Expandable Converter
      - Base Culture Converter
- Batch Name Settings - Converter
- Border - Converter
- 
    Choice Converter
      - Activity Processing - Queue Converter
- AI Chat Filter - Index Converter
- AI Search - Api Version Converter
- Apply Image Cleanup - Ip Profile Converter
- Azure OCR - Api Version Converter
- Azure OCR - Model Version Converter
- Barcode Extractor - Output Group Converter
- Base Combining Provider - Group Name Converter
- Batch - Step Converter
- Batch Process - Queue Converter
- Batch Process Step - Processing Scope Converter
- Batch Process Step - Queue Converter
- Batch Transfer - Process Converter
- Batch Transfer - Repository Converter
- Batch Transfer - Step Converter
- Build Fine Tuning File - Fill Method Converter
- Chat Filter - User Id Converter
- Chat Filter - User Name Converter
- Classify - Classification Level Converter
- Classify - Output Level Converter
- CMIS Export - Creatable Child Type Converter
- CMIS Export - Creatable Folder Converter
- CMIS Type Reference - Cmis Type Converter
- CMISQL Element - Qrderable Property Converter
- CMISQL Element - Queryable Property Converter
- CMISQL Element - Selectable Property Converter
- CMISQL Query - Primary Type Converter
- ColorTrac Scanner - Resolution Converter
- Comparison Filter - Function Name Converter
- Comparison Filter - Operand Type Converter
- Comparison Filter - Value Type Converter
- Comparison Predicate - Comp Op Converter
- Comparison Predicate - Value Converter
- Data Element - Display Label Converter
- Data Field - Sub Element Converter
- Database Table - Extended Property Name Converter
- Database Table - Table Name Converter
- EDI Schema Importer - X12Schema Converter
- Fill - Fill Method Converter
- Fill Data - Name Converter
- Fill Descendants - Name Converter
- Flag Item - Flag Reason Converter
- Generate Local Type - Doc Type Property Converter
- Import Provider - Disposition Converter
- Import Repository - Repository Converter
- ISIS Device - Device Name Converter
- Join Clause - Secondary Type Converter
- Label Info - Parent Label Converter
- Lexicon Lookup - Lookup Field Converter
- Lexicon Lookup - Target Field Converter
- Nested Table - Table Converter
- ODBC - Pg Odbc Dsn Converter
- Pattern-Based - Group Name Converter
- PDF Data Mapping - Font Name Converter
- Predicate List - Logical Operator Converter
- Read Metadata - Property Name Converter
- Reference - Group Name Converter
- Regular Expression - Group Converter
- Remove From Index - Index Name Converter
- Remove Overrides - Property Name Converter
- Reset - Step Converter
- Root - License Url Converter
- Route Definition - Process Converter
- Run Step - Step Converter
- Schema Mapping - Schema Name Converter
- Search Index - Index Name Converter
- Search Index Query - Index Name Converter
- Send To Test Batch - Flag Reason Converter
- Set Field Value - Value Converter
- String - Pdf Font Name Converter
- Task Filter - Activity Name Converter
- Task Filter - Process Name Converter
- Task Filter - Queue Converter
- Task Filter - Step Name Converter
- Text Document - Encoding Converter
- Text Document - Normalize Encoding Converter
- TWAIN Device - Compression Mode Converter
- TWAIN Device - Device Name Converter
- Update Process - Process Converter
- Update Process - Step Converter
- Value Selector - Target Field Converter
- XML Value Selector - Target Field Converter
 
- Double Range - Double Range Converter
- Expandable Info Converter
- Integer Range - Integer Range Converter
- Logical Border - Arrow Converter
- Logical Border - Logical Border Converter
- Logical Point - Logical Point Converter
- Logical Rectangle - Logical Rectangle Converter
- Logical Size - Logical Size Converter
- On Off Converter
- Percent Range - Percent Range Converter
- Point ExF - Converter
- Rectangle - Converter
- 
    Type Selector
      - CMISQL Query - Where Element Converter
- Data Connection - Connection Converter
- Exchange - Auth Method Converter
- Execute - Command Converter
- Execute Activity - Activity Converter
- Execute Command - Command Converter
- Run Activity - Activity Converter
- SharePoint - Auth Method Converter
- Storage Type - Converter
- Web Service Lookup - Auth Method Converter
 
- Unit Border - Unit Border Converter
- Unit Line Length - Unit Line Length Converter
- Unit Point - Unit Point Converter
- Unit Range - Unit Range Converter
- Unit Rectangle - Converter
- Unit Size - Unit Size Converter
- Value Extractor - Converter
 
- JPEG 2000 - Ratio Converter
- Logical Value - Simple Value Converter
- Logical Value - Universal Value Converter
- Node Information - Value Converter
- Page Filter Converter
- Path Expression - Converter
- Percent Converter
- Pg Dictionary Converter
- Pg Flags Converter
- Pg Ref Collection Converter
- Pg String Collection Converter
- Pg Type Display Name Converter
- Product License - Quantity Converter
- Read Only Converter
- Rectangle - Inches Converter
- 
    Simple Converter
      - Click to Edit Converter
- Data Action - Source Element Converter
- Data Action - Target Element Converter
- IN Predicate - In Predicate Values Converter
- OAuth Authentication - Login Converter
- Pattern Match - Group Options Converter
- Pg Format Converter
- Product License - Quantity Used Converter
- Project - Projects Converter
- Publish To Repository - Repository Converter
- Result Set Options - Sort Order Converter
- Review - View List Converter
- Stats Query - Name List Converter
- String - Pg Text Lines Converter
- Type Permissions - Command Converter
- Word Match - Term Options Converter
 
- Text Rendering - Size Converter
- Time Range Converter
- Time Ranges Converter
- Time Span Converter HMS
- Timer Service - Time Converter
- Times Converter
- Unit Value - Unit Value Converter
- Word Match - Integer Range Converter
 
- 
        Property Editor
        - Anchor Definition - Location Editor
- Barcode Detected - Preview Image Editor
- 
    Choice Property Editor
      - Azure Document Intelligence OCR - Model Editor
- Base Culture Editor
- 
    Check List Editor
      - AI Assistant - Search Index Editor
- AI Table Reader - Included Columns Editor
- Batch Filter - Activity Editor
- Batch Filter - Process Editor
- Batch Filter - Status Editor
- Batch Filter - Step Editor
- CMIS Type Reference - Secondary Types Editor
- Data Fill Method - Included Children Editor
- Delete Fine Tuned Model - Models Editor
- Generate Local Type - Property Check List
- IMAP - folder Editor
- Publish To Repository - Repository Editor
- Rebuild Indexes - Table Names Editor
- Reset - Step Checklist Editor
- Stats Query - Name List Editor
- Table Mapping - Column Check List
- Text Analysis - Entity Type Editor
- Type Permissions - Command Editor
 
- Data Connection - Table Name Editor
- Delete Fine Tuned Model - Model Editor
- GPT Embed - Embeddings Model Editor
- LLM Connector - Chat Model Editor
- LLM Connector - Embeddings Model Editor
- Return Value - Column Editor
- SQL Server - Database Name Editor
- Start Fine Tuning Job - Model Editor
 
- CMIS Compound Type - Editor
- CMISQL Query - Query Editor
- 
    Code Property Editor
      - AI Chat Filter - Filter Editor
- AI Schema Extractor - Schema Editor
- Ask AI - Schema Editor
- 
    Code Expression Editor
      - Batch Process Step - Next Step Editor
- Batch Process Step - Should Submit Editor
- Calculate Value - Value Expression Editor
- CMIS Export Map - Expression Editor
- CMIS Import Map - Expression Editor
- Code Expression - Editor
- Column Map - Expression Editor
- Computed Field - Expression Editor
- Concat - Trigger Editor
- Content Type - Caption Editor
- Copy Base - Trigger Editor
- Custom Statement - Statement Editor
- Data Export - Alternate Database Editor
- Data Field - Default Value Editor
- Data Field - Field Expression Editor
- Data Rule - Trigger Editor
- Data Section - Caption Editor
- Expression Set - Default Value Editor
- Expression Set - Field Expression Editor
- IP Element - Next Step Editor
- IP Element - Should Execute Editor
- Lookup Specification - Trigger Editor
- Metadata Options - Value Editor
- Path Expression - Editor
- Raise Issue - Log Message Editor
- Remove - Trigger Editor
- Require Value - Log Message Editor
- Text Transform - Record Editor
- Variable Definition - Expression Editor
 
- Create Table - Statement Editor
- Data Field Container - Css Editor
- Database Lookup - SQL Query Editor
- Embedded Lexicon - Local Entries Editor
- Json Property Editor
- KVP Editor
- Lexicon - Lexicon Link Code Editor
- List Match - Local Entries Editor
- Mail Import - IMAP Query Editor
- Pattern Match - Output Format Editor
- Regex Property Editor
- Search Classifier - Filter Editor
- Search Index - Filter Editor
- Search Index Query - Filter Editor
- Search Index Query - Order By Editor
- Search Index Query - Search Editor
- Send Mail - Template Editor
- String List Editor
- Submit Indexing Job - Select Editor
- Subset Filter - Filter Editor
- Text Property Editor
- Web Service - Header Editor
- Web Service Lookup - Post Data Editor
- Web Service Lookup - Url Editor
- Word Match - Output Format Editor
- XML Lookup - Selector Editor
- XML Transform - Transform Editor
- XML Value Selector - Path Editor
 
- Folder Browse Editor
- LDAP - ACL Editor
- OAuth Authentication - Login Editor
- Object Collection Editor
- Object Properties Editor
- Open File Editor
- 
    Reference Editor Base
      - 
    Node Reference Editor
      - Archive - Folder Editor
- Batch Process Step Editor
- Content Type Editor
- Custom Statement - Scope Editor
- Data Action - Action Element Editor
- Data Action - Source Editor
- Data Action - Target Editor
- Data Field Container - Rule Editor
- Data Rule - Scope Editor
- Database Cleanup - Folder Editor
- Dispose Batch - Target Folder Editor
- Execute Rule - Rule Editor
- Field Match - Field Editor
- Generate Subsets - Field Editor
- Grid Layout - Header Column Editor
- Piece Info Options - Key Column Editor
- Piece Info Options - Value Column Editor
- Return Value - Field Editor
- Set Field Value - Field Editor
- Table Mapping - Scope Editor
- Task Filter - Batch Editor
- Test Batch Editor
- Text Transform - Scope Editor
- Train Lexicon - Scope Editor
- Virtual Table Definition - Collection Editor
- Web Service - Definition File Editor
 
- Ordered Reference Editor
- 
    Reference List Editor
      - All Nodes Reference Editor
- Behavior - Field List Editor
- Bookmark Options - Data Element Editor
- Build Fine Tuning File - Batch Editor
- Content Types Editor
- Correct - Fields Editor
- Data Fill Method - Included Descendants Editor
- Data Model - Style Sheets Editor
- Data Rule - Required Elements Editor
- Data Values - Included Elements Editor
- Extract - Data Element Filter Editor
- Indexing Behavior - Included Elements Editor
- JSON Data Mapping - Included Elements Editor
- Lexicon - Lexicons Editor
- Piece Info Options - Element Editor
- Project - Projects Editor
- Redact - Extractors Editor
- Redact - Fields Editor
- Require Value - Required Elements Editor
- Section Extract Method - Included Descendants Editor
- Thumbnail View - IP Profiles Editor
- Transaction Detection - Field List Editor
 
 
- 
    Node Reference Editor
      
- Sample Image Collection - Editor
- Value Extractor - Editor
- Zone Editor
 
- Schema Importer
- Section Extract Method
- Separation Provider
- Service Instance
- Storage Type
- Table Extract Method
- Task View
- 
        UI Element
        - 
    Control
      - Active Task List
- AI Helper
- Batch Info Tab
- Batch Info Viewer
- Batch List
- Batch Manager
- Candidate List
- Card List
- Chat Console
- Class Help
- CMIS Repository Searcher
- CMIS Tree Browser
- CMIS Type Tree
- Code Editor
- Complete List
- Content Viewer
- Context Menu
- Conversation Viewer
- Data Element Tester
- Data Grid
- Data Grid Document
- Data Grid Element
- Data Inspector
- Data Tree
- 
    Design Tab
      - AI Assistant - Chat History
- Batch
- Batch Folder - General
- Batch Page - General
- Batch Process
- Batch Process Step
- CMIS Connection - General
- CMIS Repository
- Content Type
- Control Sheet - General
- Data Connection - General
- Data Element
- Data Rule - Tester
- Extractor Node - Tester
- Field Class - Weightings
- Folder - Batches
- IP Element Container - Tester
- IP Step - Tester
- Lexicon - General
- Machines
- Node
- OCR Profile - Tester
- Processing Queue - Workers
- Project - Usage
- Resource File - General
- Root
- Training Page - General
 
- Design Tab Host
- Diagnostics Viewer
- Document Searcher
- Document Viewer
- Expression Grid
- Extractor Builder
- FRX Grid
- FRX Visualizer
- Image Editor
- Image Print Preview
- Image Viewer
- Instance Searcher
- Label Set Editor
- List Searcher
- Lookup Fields
- Lookup Results
- Node Finder
- Node Report
- Object List
- OCR Viewer
- Page Navigator
- Profile Browser
- Property Grid
- 
    Property Grid Editor
      - ACL Editor
- Anchor Editor
- Choice Editor
- CMIS Query Editor
- Code Property Editor
- Collation Editor
- Collection Editor
- Extractor Property Editor
- Folder Editor
- List Editor
- Multi Reference Editor
- OAuth Log-in Editor
- Object Editor
- Ordered Reference Editor
- Preview Image Editor
- Reference Editor
- Sample Image Editor
- Zone Editor
 
- Property Help
- Query Editor
- Query Helper
- Query List
- Recognition Tester
- Rep Info Panel
- Review Tab
- Search Result Cards
- Separation List
- Service Collection
- Splitter
- Stats Report
- Stats Result Set
- Stats Viewer
- Tab List
- Task List
- Test Source
- Tree Viewer
- Upload Dialog
- Weightings List
 
- Web Page
 
- 
    Control
      
- Value Extractor
- Variable Provider
- 
      Other Configuration Types
      - API Key
- Archive Info
- Border
- Capture Settings
- Character Class Filter
- Chat Parameters
- CMIS Object
- CMIS Property Definition
- Code39Settings
- 
    Connected Object
      - Batch Filter
- Chat Filter
- Database Row
- 
    Embedded Object
      - AI Chat Filter
- AI Chat Settings
- AI Generator
- Anchor Definition
- Attachment Rule
- Auto Complete Settings
- Barcode Reader
- Batch Creation Settings
- Batch Name Settings
- Bookmark Options
- Bot Connector
- Boundary Detector
- Chunk Settings
- Cluster Parameters
- CMIS Export Map
- CMIS Folder Reference
- CMIS Import Map
- CMIS Type Definition
- CMIS Type Reference
- Code Expression
- Column Map
- Command Options
- Computed Field
- Content Mapping
- Custom Statement
- Data Element Extension
- Data Element Profile
- Data Fill Method
- Data Generator
- Edge Adjustment
- Embedded Lexicon
- Environment Options
- Execute Step
- Expression Set
- Field Annotation
- Field Mapping
- File Reference
- Folder Level Info
- FRX Options
- FTP Repository Configuration
- Fuzzy Lookup Options
- Horizontal Tab Marker
- HTTP Auth Method
- HTTP Resource
- Hyperlink Selector
- Image Segmentation Options
- Import Schedule
- Index Stats
- Label Info
- Label Set
- Label Version
- Layout Provider
- Line Periodicity Detector
- LLM Provider
- Lucene Query
- Metadata Options
- Multiline Row Settings
- OCR Layer
- OCR Repair Options
- OMR Box
- Page Attachment Rule
- Paragraph Marker
- Path Expression
- PDF Expand Method
- Permission Set
- Piece Info Options
- Quoting Method
- Region Definition
- Repository Configuration
- Repository Option
- Resource Reference
- Result Filter
- Result Processor
- Result Set Options
- Return Value
- Route Definition
- Sample Image Collection
- Schema Mapping
- Search Filter
- Separate Action
- Service Deployment
- Service Stats
- Stats Query
- Subset Filter
- Table Header Detector
- Table Mapping
- Table Row Detector
- Text Preprocessor
- Transaction Extractor
- Type Permissions
- Value Lookup
- Value Selector
- Variable Definition
- Vector Search Options
- Vertical Tab Marker
- Virtual Table Definition
- XML Value Selector
 
- Node Query
- Purge Folder
- Search Index Query
- Task Filter
 
- Constrained Wrap Options
- Culture Data
- Dash Detector
- Database Connection Settings
- Defect Generator
- Double Range
- Dropout Method
- Event Filter
- Fiche Card Layout
- Folder Level Options
- Horizontal Alignment Settings
- HTTP Authentication Method
- Image Compression
- Image Info
- Integer Range
- Line Snap Options
- Margin Detector
- Multi Line Settings
- Node Information
- PDF Burst Settings
- PDF Page Generator
- PDF Render Settings
- Percent Range
- Rectangle
- Region Detector
- Regular Expression
- Remote Repository
- Row Alignment Settings
- Scan Once Settings
- Semantic Quoting Query
- SFTP Repository Configuration
- Shell Execute Info
- Sort Specification
- System Config
- Text Wrap Options
- TIFF Page
- Transaction Layout Detection
- Vertical Wrap Detection
 
- Advanced Topics
- 
      Enumerations
      - 
      Grooper
      - CharacterCasing
- ConcurrencyMode
- DatabaseStatus
- DCTModes
- EventType
- NodeAttributes
- Pages
- PixelFormat
- ProcessingScope
- ProcessingStatus
- ResultOrder
- SimplePixelFormat
 
- 
      Grooper.Activities
      - ActionType
- BatchDisposition
- BatchNameSuffixEnum
- BodyRenderingMethod
- ComparisonMode
- DuplicateDisposition
- ExecuteType
- ExecutionScope
- ExtractMode
- FilterType
- MatchActions
- OcrAssistMode
- PageExtractMode
- ProblemDisposition
- ReclassifyModes
- RepairScope
- RouteMethod
- SaveDisposition
- SharedBehaviorModes
- SpawnMethod
- StatsLoggingMode
- TextExtractMode
- TrainingScope
- XmlSource
- XmlTarget
 
- 
      Grooper.Capture
      - FeedOrientation
- ImportType
- MissDispositionEnum
- PageDirection
- ScanningSpeed
- TwainCompressionModes
 
- 
      Grooper.Capture.ColorTrac
      - ColorFormat
- PageSizeMode
- PaperEndCondition
- PaperJustification
- ScanSpeed
- StandardPageSize
 
- 
      Grooper.Cloud
      - ApiRegionEnum
- ContentLayout
- HttpVerbs
- MessageFormats
- MetadataModes
- TranslateDisposition
 
- 
      Grooper.CMIS
      - AuthenticationProvider
- CmisProtocol
- ImportModes
- LoadScope
- NamingMethods
- OrderByDirection
- TransferScopes
 
- 
      Grooper.Core
      - ActivateModes
- ArrayActions
- AttachmentPosition
- BrowserSuggestMode
- CalculateModes
- CalculateModes
- CaptureScope
- ClassificationLevel
- CompareMode
- ConflictDispositions
- ConflictResolution
- ControlCharacters
- CreateModes
- DedupMode
- DispositionType
- DuplicateFilenameResolution
- FolderRelativePosition
- FolderRelativePosition
- FooterModes
- FormatOptions
- FuzzyMatchMode
- GroupingColumn
- IdfModes
- IssueDisposition
- JsonLayout
- LexiconType
- MergeModes
- MissDispositions
- MissDispositions
- NumberFormats
- OxiElement
- PaginationType
- ParagraphOptions
- PdfBuildOptions
- PopulationMethod
- ProcessingLevel
- PropagationMode
- SegmentType
- SortColumns
- SortDirection
- SortDirections
- SortDirections
- SortOption
- SortOrder
- StandardWeightings
- TabOptions
- TaskScope
- TfModes
- TimeFrames
- TimeGrouping
- TrainingScopes
- TriggerModes
- TypeKind
- TypeModes
- TypeOperation
- UserTrainingMode
- ValueInterpretations
- ZIPDispositions
 
- 
      Grooper.EDI
      - AttachmentNamingMethods
- DataDisposition
- NamingMethods
- NamingMethods
 
- 
      Grooper.Extract
      - AdjustmentMethod
- AlignmentMode
- CollationType
- CombineType
- CompassDirection
- ConfidenceModes
- ContextScopes
- CultureScopes
- ExecutionScope
- FlowDirection
- GroupingType
- HorizontalDataAlignment
- HorizontalDataAlignment
- LabelLayout
- LookupOption
- MappingType
- OmrBoxDirection
- OmrFlowDirection
- OmrMode
- OutputValueOptions
- ReadDirection
- ReadMethods
- ReferencePointPosition
- ROIModes
- RowDetectionMode
- RowMatchOptions
- SecondaryExtractMethod
- SecondaryExtractTrigger
- SplitPositionEnum
- TableRowAlignment
- TableStyles
- VerticalDataAlignment
- WordTransform
 
- 
      Grooper.GPT
      - AuthorizationMethod
- BooleanOperator
- BuiltInFieldKinds
- DocumentLinkingOptions
- FieldAlignMode
- IndexOperations
- LambdaFunction
- LayoutComponentTypes
- OperationType
- QueryTypes
- ReasoningEffortLevels
- ResultOrder
- RetrievalOptions
- RowAlignMode
- SearchModes
- SectionAlignMode
- ServiceTiers
- VerbosityLevels
 
- 
      Grooper.IP
      - AdaptiveKernelType
- AngleCategory
- Axis
- BinarizationMethod
- ChannelNumber
- Code39Options
- ColorSpaceType
- CombDetectionType
- CompressionMode
- Connectivity
- CropMethod
- CurveType
- DetectMethod
- FeatureType
- FillMethod
- FilteringLevel
- FilterTypeEnum
- HarrisFilterType
- HoughLevel
- ImageEdges
- InpaintMethod
- MaskShape
- MaskSize
- MeasurementType
- Method
- OneDimSymbology
- OperationType
- OperationType
- Pdf417Options
- PostSymbology
- ProcessingResolution
- ProgressionOrder
- ReadDirection
- ReadingQuality
- ResizeInterpolationMode
- SizeMethod
- Symbology
- TwoDimSymbology
- WarpInterpolationMode
 
- 
      Grooper.Messaging
      - BodyHandling
- Orientation
- PaperKind
- SaveAction
- SelectorKind
 
- 
      Grooper.OCR
      - AccuracyLevels
- BaseCharacterSetEnum
- DetectionMethod
- EngineModeEnum
- FontPitchMode
- LexMode
- PageOrientation
- PageOrientation
- SegmentationModeEnum
- SynthesisMethodEnum
 
- 
      Grooper.Office
      - SaveMethod
 
- 
      Grooper.PDF
      - CompressionMode
- ImageLayout
- PDFAComplianceLevels
- PdfBorderStyle
- PdfDisplayMode
- PdfPermissions
- PdfViewerOptions
- SearchableTextFormat
- TargetColorFormat
 
- 
      Grooper.Services
      - DaysOfWeek
 
- 
      Grooper.Services.CMIS
      - ConnectMethod
- ContentMode
- FileType
- FormOverlayType
- MergeAction
 
- 
      Miscellaneous
      - BaseTypeId
- CharacterCasing
- CompressionLevel
- ContentAlignment
- DateTimeStyles
- FileAttributes
- FontStyle
- Formatting
- HorizontalAlignment
- Keys
- NumberStyles
- RegexOptions
- ThreadPriority
- UriKind
 
 
- 
      Grooper
      
Text Preprocessor
Applies configurable text preprocessing to a document's content before regular expression extraction.
Remarks
The Text Preprocessor enables advanced manipulation of control characters in a document's text, allowing regular expressions to match or ignore structural elements such as line breaks, paragraph boundaries, page breaks, tabs, and spaces.
Overview
Text preprocessing is performed immediately before extraction, transforming the document's text to improve the accuracy and flexibility of pattern matching. This is especially useful when data values span multiple lines, are separated by large whitespace gaps, or are affected by inconsistent formatting.
Key Features
- 
Paragraph Marking: 
 Detects paragraph boundaries and converts line breaks within paragraphs to spaces, while preserving paragraph-ending breaks. This allows extractors to match values that span multiple lines within a paragraph, without matching across paragraph boundaries. See Paragraph Marker.
- 
Tab Marking: 
 Replaces large horizontal whitespace gaps with TAB characters, making it possible to distinguish between normal spaces and significant gaps in regular expressions. See Horizontal Tab Marker.
- 
Vertical Tab Marking: 
 Converts certain line breaks to vertical tab characters based on vertical spacing, enabling recognition of vertical structure in tabular or multi-column layouts. See Vertical Tab Marker.
- 
Control Character Ignoring: 
 Removes or replaces selected control characters (such as spaces, newlines, form feeds, and carriage returns) according to the 'Ignore Control Characters' setting. This can simplify extraction in documents with inconsistent or excessive whitespace.
Usage Guidance
- Configure the desired preprocessing options by enabling or disabling paragraph, tab, and vertical tab marking, and by selecting which control characters to ignore.
- Preprocessing is typically used in conjunction with regular expression-based extractors, but can benefit any extraction scenario where document structure affects pattern matching.
- For best results, adjust preprocessing settings to match the structure and formatting of your source documents.
Example Scenarios
- 
Extracting values that span multiple lines within a paragraph: 
 Enable paragraph marking to convert internal line breaks to spaces, allowing regular expressions to match values split across lines.
- 
Distinguishing between normal spaces and large gaps: 
 Enable tab marking to insert TAB characters at significant horizontal gaps, so extractors can target fields separated by large whitespace.
- 
Cleaning up unwanted whitespace or control characters: 
 Use the 'Ignore Control Characters' option to remove or replace problematic characters that interfere with extraction.
For more details, see the documentation for Paragraph Marker, Horizontal Tab Marker, and Vertical Tab Marker.
Examples
1. Sample Document
Consider the following sample document.
┌─────────────────────────────────────────────────────────────┐
│                        SAMPLE FORM                          │
├─────────────────────────────────────────────────────────────┤
│ Name:           John Doe                   ID: 12345        │
│ Date of Birth:  01/01/1980                 Status: Active   │
├─────────────────────────────────────────────────────────────┤
│ This is the first paragraph. It explains the purpose of     │
│ the form and the meaning of each field.                     │
│                                                             │
│ Please complete all fields and verify all personal          │
│ information before submitting. Thank you!                   │ 
└─────────────────────────────────────────────────────────────┘
2. Default Control Characters
With no preprocessing options enabled, the document data will look like this.  Whitespace gaps, no matter how large,
are represented by a single space character.  A \r\n pair marks each location where the original document wrapped
to the next line.
SAMPLE FORM\r\n
Name: John Doe ID: 12345\r\n
Date of Birth: 01/01/1980 Status: Active\r\n
This is the first paragraph. It explains the purpose of\r\n
the form and the meaning of each field.\r\n
Please complete all fields and verify all personal\r\n
information before submitting. Thank you!\r\n
3. Preprocessed Version
Preprocessing the document with paragraph marking and tab marking will place a tab character '\t' at each large whitespace gap, and replace newline pairs '\r\n' occuring inside a paragraph with a space.
SAMPLE FORM\r\n
Name: John Doe\tID: 12345\r\n
Date of Birth: 01/01/1980\tStatus: Active\r\n
This is the first paragraph. the form and the meaning of each field.\r\n
Please complete all fields and verify all personal information before submitting. Thank you!\r\n
Properties
| Name | Type | Description | ||||||||||||||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Paragraph Marking | Paragraph Marker | ► | Detects and marks paragraph boundaries in natural language documents to improve data extraction from paragraph flow text. Can be one of the following types: 
 The Paragraph Marker is a text preprocessing component used to identify paragraph boundaries in documents, especially those containing natural language text. By marking paragraphs, it enables more accurate extraction of data that may span multiple lines within a paragraph, while preserving true paragraph breaks. PurposeParagraphs in documents often wrap across multiple lines, causing data values to be split by line breaks (CR/LF). This can make it difficult for extractors to match values that span lines, as standard extraction logic may not account for embedded line breaks within paragraphs. The Paragraph Marker solves this by detecting paragraph boundaries and converting line breaks inside paragraphs to spaces, while leaving the line break at the end of each paragraph intact. This produces a normalized text flow, making it easier to extract values that span lines. How It WorksThe Paragraph Marker processes the text of a document by analyzing each line and determining whether it should be joined with the previous line or treated as the start of a new paragraph. The main algorithm works as follows: 
 This approach ensures that wrapped lines within a paragraph are merged for extraction, while true paragraph boundaries are maintained for downstream processing. ExampleConsider the following paragraph, where the effective date is split across two lines: Without paragraph marking, an extractor searching for "February 1, 1988" would not find a match due to the embedded line break (\r\n) after 'February'. With paragraph marking enabled, the text is normalized as: Now, extractors can reliably match values that span lines within a paragraph, without overmatching across true paragraph boundaries. Configuration Guidance
 Usage Notes
 | |||||||||||||||||||||||||||
| Tab Marking | Horizontal Tab Marker | ► | Detects and inserts tab characters into text based on whitespace gaps, font size changes, or document layout features such as vertical lines and underlines. Can be one of the following types: 
 The Horizontal Tab Marker class is used to identify locations in text where a tab character ( OverviewHorizontal Tab Marker analyzes the spacing between words and other layout cues to determine where tabs should be placed. It is commonly used in text preprocessing to convert visually separated columns or fields into a tab-delimited format, making downstream data extraction and parsing more reliable. How It WorksThe Horizontal Tab Marker processes the text by analyzing the gaps between words and determining where a tab character should be inserted. The main algorithm works as follows: 
 This approach ensures that visually separated columns or fields are accurately marked with tabs, improving the reliability of downstream data extraction and parsing. Configuration Guidance
 Example 1: Field Extraction with Large Whitespace GapFor example, consider a document region containing two field values with a large whitespace gap in between, like this: 
 When text is extracted without tab marking, the large gap is represented as a single space, making it difficult to determine where one field ends and the next begins.
If you use an extractor with a pattern like  
 By enabling tab marking, the large gap is replaced with a tab character. Now an extractor looking for  
 Example 2: Table Row with Multiple ColumnsConsider a table row in a document where columns are separated by large whitespace gaps: 
 Without tab marking, the extracted text may look like: 
 This makes it difficult to reliably extract each column value. With tab marking enabled, the output will be: 
 Now, each value is clearly separated by a tab character, making column-based extraction straightforward and robust. Notes
 | |||||||||||||||||||||||||||
| Vertical Tab Marking | Vertical Tab Marker | ► | Detects and marks large vertical whitespace gaps between lines with a vertical tab character to represent vertical separation in text. Can be one of the following types: 
 The Vertical Tab Marker is a text preprocessing component that identifies significant vertical gaps between lines in a document and replaces the standard
line break (CR/LF) with a vertical tab character ( PurposeIn many documents, a large vertical gap between lines indicates a new section, a table row, or a logical break. Standard line breaks do not distinguish between normal line wrapping and these larger separations, making it difficult for downstream extractors to interpret the document's structure. The Vertical Tab Marker solves this by converting line breaks to vertical tab characters when the vertical gap between two lines exceeds the configured threshold. This allows extractors and parsers to recognize and handle vertical structure more accurately. How It Works
 This approach enables downstream extraction logic to distinguish between normal line wrapping and significant vertical separations, improving the accuracy of data extraction from structured documents. Configuration Guidance
 Usage Notes
 | |||||||||||||||||||||||||||
| Ignore Control Characters | ControlCharacters | ► | Specifies which control characters are ignored or replaced during text preprocessing. 
          Pg Flags Converter
              None
           A combination of the following flags: 
 The Control Characters enum defines options for removing or replacing specific control characters in a document's text prior to extraction. These options are used by the Text Preprocessor to clean up or normalize whitespace and line breaks, improving the reliability of pattern matching and data extraction. Multiple values can be combined to ignore several types of control characters at once. | 
See Also
Paragraph MarkerHorizontal Tab MarkerVertical Tab Marker
Used By
Field MatchLabel MatchPattern-BasedFlowList MatchPattern MatchWord MatchLabeled RegionExtractedSemanticAsk AI
