How to Reduce Fine-Tuning Error by 37%

chrismauck10 · May 2, 2023, 4:47pm

Hello everyone!

I spent some time playing around with the OpenAI fine-tuning API and I discovered that noisy data still has drastic effects even on powerful LLMs like Davinci.

I took some time to write about how to use data-centric AI in this recently published article in KDNuggets so that you can improve your models too The results I found were quite eye-opening.

Let me know what you think!

curt.kennedy · May 2, 2023, 5:11pm

I like it! Auto-detect and remove (or correct) outliers in your training data.

But why did you embed with davinci-001 and not the newer ada-002? Some reasoning here? Wondering if you would get better results since ada-002 is supposed to be better and has way less dimensions than davinci-001.

PaulBellow · May 2, 2023, 11:12pm

Welcome to the community!

Thanks for sharing your results with us.

Cleaning datasets is going to be needed even more in the months/years ahead.

Topic		Replies	Views
How to improve a fine-tune classifier? Prompting	10	1337	August 15, 2022
Fine tune fine tuned models API	18	3719	January 30, 2024
[finetuning] Latest video about finetuning and Core Objective Functions is up! Community	2	730	December 24, 2023
Fine-tune and Davinci API	1	1311	February 6, 2024
Langchain OpenAI Pinecone chatbot issue API langchain , large-language-model	5	1821	December 17, 2023

How to Reduce Fine-Tuning Error by 37%

Related topics