Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Blank Page Detection

IP Command Grooper.IP

Detects whether an image is blank for the purposes of automated page deletion, separation, or classification.

Remarks

The Blank Page Detection command analyzes a document image to determine if it is blank or contains meaningful content. This is essential for workflows that require the removal of blank pages, separation of document batches, or classification of pages as blank for downstream processing.

The command works by evaluating the amount and distribution of non-background pixels, optionally after binarization and preprocessing. It supports advanced features such as border exclusion, noise filtering, and configurable thresholds to accommodate a wide range of document types and scanning conditions.

How Blank Page Detection Works

  1. The image is binarized (if configured) to separate foreground from background.
  2. Optional preprocessing steps, such as border exclusion or despeckling, are applied.
  3. The system calculates the black pixel weight (total area of non-background pixels) and compares it to the configured detection limits.
  4. If the weight is below the minimum, the page is classified as blank; if above the maximum, as non-blank; if in between, speck analysis is performed.

Supported Pixel Formats

  • The command supports all basic pixel formats: Pixel8bppGrayscale, Pixel24bppBgr, and Pixel1bppIndexed.
  • Images are automatically converted as needed for processing.

Diagnostics

When run in diagnostic mode, Blank Page Detection generates outputs to assist with configuration and troubleshooting:

  • Binarized and preprocessed images showing the effect of thresholding and border exclusion.
  • Diagnostic overlays indicating detected content regions and border size.
  • Log messages reporting pixel counts, detection limits, and classification results.

Use these diagnostics to fine-tune detection parameters and ensure that blank and non-blank pages are correctly identified.

Classification Features

  • This command can generate classification features indicating whether a page is blank or non-blank, which can be used by downstream logic for separation, deletion, or routing.

Practical Guidance

  • Start with default settings and review diagnostic images to assess detection quality.
  • Adjust detection limits and border size to accommodate your document types and scanning conditions.
  • Use speck thresholding for forms with background artifacts or low-level noise.
  • Always validate results visually and iteratively adjust parameters for best performance on your specific document set.

Properties

NameTypeDescription
General
Command Info

See Also

Used By

Notification