Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Transym OCR 4

Transym OCR Engine Grooper.OCR

Provides an OCR engine for Grooper using the commercial Transym OCR 4 library.

Remarks

Overview

The Transym OCR engine integrates the Transym OCR 4 library into Grooper, offering highly accurate English-only OCR for machine-printed documents.

This engine is fully installed with Grooper and is designed for high-performance recognition of structured and unstructured documents containing printed text. It does not support handwriting or languages other than English.

Features

  • Optimized for English-language, machine-printed text.
  • Includes advanced image cleanup options such as auto-inversion, deskew, deshade, noise removal, and line removal.
  • Supports merging of broken characters and removal of questionable lines or characters.
  • Can use a lexicon to improve word correction.
  • Allows restriction of recognized characters via the 'Allowed Characters' property.
  • Supports automatic or fixed orientation detection.

Configuration

  • Enable or disable image cleanup options to optimize recognition for your document types.
  • Use 'Allowed Characters' to restrict recognition to a known set of characters for forms or codes.
  • Set 'Orientation' to 'Auto' for automatic detection, or specify a fixed orientation if documents are consistently rotated.
  • Enable 'Use Lexicon' to improve recognition of dictionary words.

Integration

Transym OCR is typically selected as the OCR engine within an OCR Profile. It is suitable for both full-page and zonal OCR, and is often used for high-volume, production-grade document processing.

Best Practices

  • Use the default image cleanup options for most documents; disable only if you observe unwanted side effects.
  • Restrict 'Allowed Characters' for forms or documents with limited character sets to improve accuracy.
  • For best results, use high-quality, clean scans with consistent orientation.
  • Review the output for questionable lines or characters and adjust filtering options as needed.

Limitations

  • English language only.
  • Does not support handwriting or complex scripts.
  • Maximum image dimension is 12,000 pixels in width or height.

Properties

NameTypeDescription
Image Cleanup
Document Structure
Processing Options

Used By

Notification