Advice & Tips for finetuning

zafarr · October 17, 2024, 5:54am

In the training data for finetuning GPT for a use case where I want the bot to solve law case (provided by user) based on provided relevant laws (will be provided within the prompt, I am getting them using RAG), do I need to provide context (relevant laws) within the input (of each training data I/O) or just the user question (law case) would be enough?

And how many examples do I need do get it working properly?

And any tips about customising the finetuning parameters for my use-case?

jr.2509 · October 17, 2024, 6:30am

Hi @zafarr -

Can you take a step back and explain what specifically you are looking to achieve with your fine-tuning project?

To your specific questions: yes, you would need to provide the context for each example.

How many examples really depends. In some cases 50-100 examples may be enough to get a decent fine-tuned model. In other cases, you may need to create 1,000+ examples for it to work properly. It’s common practice to start small and see where this takes you and then add more examples to further optimize it.

zafarr · October 17, 2024, 6:33am

I want to finetune so it can solve law cases, in a specific format and I assume that finetuning it will also help it understand how exactly to go about solving a case, I mean it probably will understand the thought process behind solving a law case by finetuning).

jr.2509 · October 17, 2024, 6:38am

Ok, got it. If the focus is on the how of solving a case, then a fine-tuned model should work.

I’d say the number of examples should then take into consideration the diversity of cases. So say the cases and the approach to solving them is fairly similar, then you should be able to get by with a smaller set of examples (e.g. 30-50 examples to start with). In contrast, if you have a larger diversity of cases, then you’d need to opt for more training examples to have an adequate representation for each type of approach in your data set in order for the model to pick up the pattern.

zafarr · October 17, 2024, 6:42am

Understood, thanks, and for larger diversity cases, do I need to ensure that number of training examples for each type of case is very similar, or would it otherwise lean towards solving in the way for which we have more examples.

jr.2509 · October 17, 2024, 6:44am

From my own experience, I’d try to keep the number somewhat similar. Like you wrote, otherwise you may run into the risk that it focuses too much on the most dominantly represented solution approach.

ds2 · October 28, 2024, 1:16pm

@zafarr - may I ask how you plan to do it? I mean, how are you gone build that model and train with the cases? I would like to do the same for the company I work for. Thanks!

zafarr · October 31, 2024, 7:42am

If your company can provide you with a proper decent sized dataset of input and output that’s the best-case scenario, then you might not even need LLMs. As for LLMs imo the real issue is context issue that causes hallucination, which I don’t think finetuning can fix itself this issue, so I haven’t figured out what exactly to do to make it work with LLMs. But if your questions are relatively straight forward and won’t ever need to use 15k-20k tokens then LLMs would be fine, just use RAG for retrieving relevant docs and you’re good to go, for getting output in desired and for further improvement you can fine-tune LLM with 50-100 examples.

Topic		Replies	Views
What is the best way of getting OpenAI API to respond with more specific & statistical responses related to Financial Markets? API fine-tuning-vs-rag	5	64	November 24, 2024
Is fine-tuning the way to go to generate legal opinions (law technical reports)? API	10	4094	December 9, 2023
Fine-Tuning with help of massive amount of documents API	25	6706	July 20, 2024
RAD embedded retrieval + fine tuning OR retrieval assistant + fine tuning. What will be better to create a gpt to help lawyers draft arguments Community fine-tuning , plugins , rag , assistants	1	945	January 12, 2024
Fine tuning - how exactly does it work? API	6	2353	December 23, 2023

Advice & Tips for finetuning

Related topics