Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Split Pages

Code Activity Grooper.Activities

Splits a multi-page document into individual Batch Pages for downstream processing.

Remarks

The Split Pages activity is designed to convert multi-page documents—such as PDFs and TIFFs—into individual Batch Pages within a Batch Folder. This is a foundational step in most Grooper workflows, enabling page-level processing, parallelization, and granular document management.

Supported File Types

  • PDF Documents:
    Handles both image-based and text-based PDFs. Supports advanced options for image bursting, rendering, and resource extraction. Image-based pages can be extracted as original images, while text-based or mixed-content pages can be split as single-page PDFs.
  • TIFF Images:
    Multi-page TIFFs are split into single-page TIFFs without decompression or recompression, ensuring no image quality loss.
  • Other Image Formats:
    Single-page images (JPEG, PNG, etc.) are added as individual pages. Multi-frame images are also supported.
  • Digital Documents (Word, Excel, etc.):
    These must be converted to PDF prior to splitting, typically using the Render activity or format-specific conversion commands.

How It Works

  • Operates on Batch Folders containing supported attachments.
  • For each input file, creates a Batch Page for every page or image frame.
  • Offers options to filter which pages are extracted, limit the maximum number of pages, and control overwrite behavior.
  • PDF-specific options allow for fine-tuned extraction, including image bursting, rendering, compression, and bookmark replication.
  • Optionally deletes the original attachment after splitting to conserve storage.

Performance and Best Practices

  • Splitting large documents can be resource-intensive, especially with image bursting or rendering enabled. For best performance, consider disabling these options during the split and using the Batch Page - Rasterize command later for parallel image extraction.
  • TIFF splitting is lossless and fast, as no image recompression occurs.
  • PDF splitting can be affected by document structure (e.g., shared resource dictionaries). Use the appropriate extraction mode to avoid file bloat.
  • When working with digital documents, always ensure a PDF version is available before splitting.

Integration with Other Activities

Error Handling and Diagnostics

  • Files exceeding the configured page limit will generate errors.
  • PDF password protection and malformed files are detected and reported.
  • Optionally flag documents when recoverable PDF issues are encountered.

Example

The following example demonstrates how Split Pages converts multi-page PDFs into individual Batch Pages. In this example, Split Pages will run on each item at folder level 1.

Before Split Pages:

 Batch
  ├─📁 Document 1 (Attachment: Invoice.pdf - 5 pages)
  └─📁 Document 2 (Attachment: May Invoice.pdf - 3 pages) 
 

After Split Pages:

 Batch
  ├─📁 Document 1
  │ ├─📄 Page 1
  │ ├─📄 Page 2
  │ ├─📄 Page 3
  │ ├─📄 Page 4
  │ └─📄 Page 5
  └─📁 Document 1
    ├─📄 Page 1
    ├─📄 Page 2
    └─📄 Page 3
 

In this example, the original 5-page PDF is split into five individual Batch Pages under the same Batch Folder.

For more information, see the documentation for Batch Folder, Batch Page, and Render activities.

Properties

NameTypeDescription
General
PDF Options
Processing Options

See Also

Used By

Notification