How to Fine-Tune a Model with Book Data for a Chatbot?

itjimkr · July 29, 2024, 2:20am

Hello OpenAI Community,

I have a question about fine-tuning. I have a large volume of book data that I would like to use for fine-tuning. However, I do not have any question-answer format data, and given the vast amount of content, I am unsure how to create such question-answer pairs from it.

My goal is to train the model on this book data and build a chatbot that can engage in conversations based on this information. I am looking for advice on the best approach to achieve this.

Thank you for your help.

curt.kennedy · July 29, 2024, 5:29am

Look into RAG (retrieval augmented generation). On this forum and elsewhere. Also look at embeddings.

You basically create a mini search engine on your book (via embeddings). And feed this to the prompt for the AI to respond to the user.

Fine tunes here are not going to soak in much knowledge. They can soak in tone, but lack specific content.

We’ve all been in your shoes … how do I convert a book into prompt/completion pairs?

For tone, their are various posts here on how to do it. But basically you create prompt/completion pairs on the book by converting each passage into a neutral passage. This is the “prompt’ leg. The completion is the original passage from the book. This fine-tune will capture the tone of your book, the writing style, etc.

itjimkr · July 29, 2024, 5:53am

Thank you very much for your help. I will continue to study and learn more about this topic.

Topic		Replies	Views
Fine-Tuning with Non-Prompt/Completion Data: Seeking Advice for Direct Text-Based Training? API gpt-4 , chatgpt , fine-tuning , api	3	212	August 23, 2024
How to fine tune gpt3 on raw text without prompt API	1	652	July 27, 2023
Data preparation for finetuning API fine-tuning	2	141	December 2, 2024
Best way to create a chatbot using pre-trained models Community fine-tuning	3	3083	April 16, 2024
Seeking Guidance on Fine-Tuning GPT-3.5 Turbo for a Biography-Based AI Chatbot API api	0	524	December 26, 2023

How to Fine-Tune a Model with Book Data for a Chatbot?

Related topics