Grooper Help - Version 25.0
25.0.0017 2,127
  • Overview
  • Help Status

Data Field Container - Build Fine Tuning File

Data Field Container Command Grooper.GPT

Generates a fine-tuning file from the documents in one or more reference batches.

Remarks

The Build Fine Tuning File command is used to create a training data file for fine-tuning a large language model (LLM) using Grooper's AI Extract functionality. This command collects extracted and human-corrected data from selected Batches and formats it into a file suitable for use with OpenAI or other LLM providers that support fine-tuning.

When to Use

  • When you want to improve the accuracy of AI Extract by training a custom LLM model on your own document samples.
  • After running extraction and performing human review/correction on a set of documents to ensure the data represents the ideal output.

How It Works

  1. Select the Data Field Container (such as a Data Model, Data Section, or Data Table) that uses AI Extract.
  2. Choose the Data Fill Method to generate fine-tuning data for.
  3. Select one or more Batches containing documents that have been extracted and corrected.
  4. Optionally, specify whether to include the schema in the fine-tuning data.
  5. Execute the command. A new fine-tuning file will be generated and saved to the "Advanced" tab of the Data Element.

You can run this command multiple times on different batches to create multiple fine-tuning files. Once you have generated the necessary files, use the Start Fine Tuning Job command to train a custom OpenAI model.

> Note: The quality of your fine-tuning results depends on the accuracy and representativeness of the data in your selected batches. Be sure to review and correct extracted data before generating fine-tuning files.

Properties

NameTypeDescription

See Also

Notification