Help/Tips on Training Gpt-3.5, Best Practice

nickyadl13 · September 7, 2023, 9:06pm

I want to train gpt-3.5 on a couple of books that are pretty long. What would it look like in terms of training format?

Like (prompt: here is page 1 of game of throwns completion: page 1) or would it be like (prompt: here is game of throwns completion: the entire book)

Foxalabs · September 7, 2023, 9:25pm

The best method I know of so far is to leave the “user” prompt blank and simply fill in the “assistant” roll with around 1000 tokens worth of text and do that as many times as is required to cover the entire book.

_j · September 7, 2023, 10:51pm

The AI has a limited amount of text that it can understand at once.

You’ll likely want to investigate an embeddings vector database, also powered by AI. This breaks something like a book into smaller understandable chunks along with a special semantic vector returned by an embeddings engine, and then a vector similarity comparison of the user input against the database can give the answering AI more knowledge by providing it pieces of the literature.

Training is more referring to methods to alter the operation of the AI.

Topic		Replies	Views
How to fine tune gpt3 on raw text without prompt API	1	675	July 27, 2023
I have a book, I want OpenAI to be trained on the book. How can I do it? Community gpt-4 , chatgpt	5	2697	June 13, 2023
How can I fine tune gpt3.5 to be able to read documentation and also books? API	8	2416	December 7, 2023
What's my best option (API) Normal promting or assistant or something else I don't know about? API	1	78	November 25, 2024
Ideal input / output form for Fine Tuning? API fine-tuning	3	1210	October 23, 2023

Help/Tips on Training Gpt-3.5, Best Practice

Related topics