Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Document Instance

Element Container Instance Grooper.Core

Represents the entire content of a document, and serves as the root of the Data Element Instance hierarchies generated by the Extract activity.

Remarks

The Document Instance class is the root data object for a document’s extracted data in Grooper. It encapsulates all content, structure, and metadata for a single document, providing the foundation for data extraction, validation, review, and export.

Role in Grooper

Document Instances are created automatically when a document is loaded or processed in Grooper, typically as the result of the Extract activity on a Batch Folder. Each Document Instance forms the root of a hierarchical tree of Data Instance objects, representing all extracted fields, tables, and sections for the document. This hierarchy mirrors the Data Model defined for the document’s Content Type.

Document Instances are essential for organizing, storing, and presenting all extracted or user-entered data for a document. They enable Grooper to support complex document schemas, multi-level data models, and advanced review and validation workflows.

Usage and Configuration

  • Document Instances are managed automatically by Grooper and are not created or configured directly by end users.
  • When a document is classified and extraction is performed, a Document Instance is created and populated with child instances for each Field Instance, Table Instance, and Section Instance defined in the Data Model.
  • The Document Instance is visible in the Data Review UI, where users can review, validate, and edit all extracted data for the document.
  • Advanced users may interact with Document Instances via expressions, custom code, or API integrations for automation, validation, or custom export scenarios.

Hierarchy and Structure

  • The Document Instance is always the root of the data instance hierarchy for a document.
  • Its children represent the top-level data elements (fields, tables, sections) as defined by the Data Model.
  • Each child may have further descendants, supporting arbitrarily complex and nested document schemas.
  • This structure enables Grooper to represent and process documents of any complexity, from simple forms to multi-page, multi-section documents with repeating tables and nested data.

Extraction, Validation, and Review

  • Document Instances are created and populated during the Extract activity, using the extraction logic defined in the Data Model.
  • Validation is performed on the Document Instance and all descendants, ensuring that all required data is present and correct.
  • The Document Instance tracks validation status, error counts, and change tracking for the entire document.
  • In the Data Review UI, the Document Instance provides a single point of access for reviewing and editing all data for the document.

Integration with Other Grooper Features

  • Document Instances are used throughout Grooper for data validation, export, reporting, and workflow automation.
  • They support advanced features such as sticky fields, change tracking, error reporting, and sibling/parent document relationships.
  • Document Instances provide access to document-level metadata, OCR results, page images, and Layout Data.
  • They are the primary data structure for transferring extracted values between activities, exporting to external systems, or presenting data for user review.

Diagnostics and Audit Trails

  • Document Instances participate in diagnostic logging and audit trails for extraction, validation, and review activities.
  • Diagnostic artifacts may include extraction logs, validation error reports, and audit trails of user edits or automated changes.
  • The Document Instance tracks error counts and invalid elements, supporting quality control and exception handling workflows.

Properties

NameTypeDescription
General
Document Reference

See Also

Notification