Fine-tuning on documents - Unsure of general process

overmyhead · May 18, 2023, 4:41pm

I have a series of specialized documents I’d like to fine-tune on. I want to pull the same information from each document, but it is embedded within the document itself.

I have the sections of each document via vector search, but I’m trying to figure out how to take those sections and add them to the training data prompts.

I’m starting each prompt with: “Given the following information: \n?” and then providing the completion.

Is this the right process for fine-tuning? For each document, I might have 30 questions. Should I just be creating a single training file with the 30 prompts and completions for each document?

Topic		Replies	Views
Fine-tuning a model without using prompt-completion API fine-tuning	1	609	July 4, 2023
GPT-3.5-turbo fine-tuning plus document retrieval Documentation fine-tuning	7	2868	November 12, 2023
Fine tune model with empty prompts API	4	1327	December 17, 2023
Fine-Tune Multiple Paragraphs Of Text - Best Practice? API	0	466	May 6, 2023
Fine tunning with raw data Prompting	3	809	December 17, 2023

Fine-tuning on documents - Unsure of general process

Related Topics