Fine tune models with URL data

matos.ray · July 26, 2024, 2:59pm

Im using chat transcripts to fine tune a model that will be using in an assistant. I left the assistant instructions alone. The chat transcript sometimes return URLS to resources, before converting that data into prompt/completions i strip all HMTL data including the URLs. I figured the instructions in the assistant along with the files uploaded would readd them. However when testing in playground, the answers are much better, but it does not hyperlink the resources, it just called it by name.

What is the best way to fine tune a model to use in an assistant? The assistant without the Finetune model provides the correct answer 90% of the time, i was hoping a fine tine model would make that 100% and now the assistant would be an expert.

jaffar.queries · July 27, 2024, 7:38pm

To fine-tune your model to include hyperlinks:

Preserve URLs in Data: Keep URLs in your chat transcripts; don’t strip them out.

Format Prompt/Completion Pairs: Ensure the completions include URLs as hyperlinks.

{
  "prompt": "User: Recommend resources on machine learning.\nAssistant:",
  "completion": "Sure! Here are a few:\n- [Machine Learning Mastery](https://machinelearningmastery.com)\n- [Coursera Machine Learning Course](https://www.coursera.org/learn/machine-learning)\n- [Deep Learning](https://youtubethumbnaildownloaderonline.com)"
}

Fine-Tune the Model: Use this correctly formatted data to fine-tune your model.
Test the Model: Verify that the model generates responses with properly formatted hyperlinks.

Topic		Replies	Views
How to correctly fine tune my own model? API	3	2473	January 21, 2023
Finetuning For An Assistant API api	3	1180	January 19, 2024
Finetuning ChatGPT3.5 to understand the content of all URLs of my website GPT builders	2	879	December 17, 2023
How to get specific response type with File Search API gpt-4	6	755	April 22, 2024
Finetune to provide a link in the response API api	1	224	October 19, 2023

Fine tune models with URL data

Related Topics