Prompting vs Finetuning : what should i choose?

osbanditinhos · January 13, 2025, 3:10pm

Hi !
I’d like to develop a template capable of generating XML code specific to an internal configuration from instructions of around 300 characters. Basic LLMs such as GPT or others give middling results, even if you give 5 examples in the prompt.
So I thought I’d have to fine-tune a template to get better performance.
I have a database of about 200 examples, + a configuration document of about 90 pages. The whole thing represents 150,000 tokens.
Do you think fine tuning is the best option? If so, do you have any advice on which hyperparameters to choose (such as learning rate, number of epochs, how much validation data to choose, etc.)?
Should I turn to another solution?
Thanks !

EricGT · January 13, 2025, 3:34pm

Also consider evals. Not suggesting this will work but not on your list.

osbanditinhos · January 14, 2025, 9:09am

Thank you i will also use evals to see the improvement. For you fine tuning seems to be a good choice for solving my problem, but before paying for fine tuning I want to know how to choose my parameters ? Do I need to provide exemple in the prompts of the training dataset as I am using a chat model ?

EricGT · January 14, 2025, 10:20am

FYI

I do not do fine-tuning so can not give any feedback.

rrivero · January 14, 2025, 12:47pm

I haven’t personally used fine-tuning, but I do have extensive experience with prompt engineering, and I believe it could be a great fit for your use case.

Based on your input, it’s not entirely clear whether you’re already using prompt engineering techniques or if you’re just relying on few-shot prompting.

With well-crafted instructions and incorporating chain-of-thought prompting, there’s a good chance you could achieve good results without needing fine-tuning.

osbanditinhos · January 14, 2025, 2:24pm

I am using usual prompt engineering technic like well-crafted instructions, chain-of-thought prompting and few-shot prompting, the result is by far better than with a simple prompt, but it is not enough. That’s why I want to fine tune the model if I can’t do anything else to improve the model.

Topic		Replies	Views
Fine tuning Davinci01 or prompting Davinci03 API	3	732	December 31, 2022
Fine Tunes Question - Use FT or Prompt Engineering? API fine-tuning , prompt-engineering	0	807	November 15, 2023
Fine-Tuning and Prompt Design Prompting	3	1221	December 25, 2023
Fine tuning - how exactly does it work? API	6	2609	December 23, 2023
Prompt Usage for Fine-Tuned Models Community gpt-35-turbo , fine-tuning	1	2268	January 4, 2024

Prompting vs Finetuning : what should i choose?

Related topics