How to fine-tune gpt3.5-turbo to give it new knowledge?

sps · October 29, 2023, 6:02pm

Welcome to the community, @brandojazz!

Fine-tuning is recommended to save on prompt tokens for a high volume of calls for specific use case(s) or to set model behavior.

For knowledge augmentation and retrieval, embeddings is the go-to approach.

Here’s the documentation on embeddings.

Currently, GPT models have a finite context length, which limits the number of tokens (prompt + completion) they can handle.

To overcome this, you can pass an outline of your repository to the model and give the model access to “see” the code, similar to how Advanced Data Analysis does on ChatGPT or like the open-interpreter does locally.

Once the model selects the file(s) to be used with RAG, you can obtain the embeddings and look for semantically similar chunks to be passed as context.

Topic		Replies	Views
Correcting wrong answers via fine-tuning API fine-tuning , fine-tuning-problems	11	3662	December 13, 2023
What to do when fine-tuning is not working? API	21	7871	December 24, 2023
Is fine-tuning the tool for this? API fine-tuning	7	261	September 10, 2024
Virtual-Me-Bot - Could you help me out? API	7	619	January 16, 2023
Fine Tuning ChatGPT with large text from Books Prompting	18	10704	March 26, 2024

How to fine-tune gpt3.5-turbo to give it new knowledge?

Related topics