Hello and thanks in advance of reviewing this.
I have an odd situation with a fine-tuned model and I am not sure if it is fine tuning related or whether is can be addressed with Prompt Engineering.
First, the symptoms.
I have three Test request threads:
- divinci-002
- divinci-003
- divinci-002-Fine-tuned (very small training data set: 250 Q&A pairs)
I asked the same series of questions to all three to test the fine-tuned version.
divinci-002*
Q: What if my insurance won’t cover the procedure?
A: If your insurance plan does not cover the procedure, we offer a variety of financing options to make the procedure more affordable. For more information, please call us at 1-800-XXX-XXXX.
divinci-003*
Q: What if my insurance won’t cover the procedure?
A: If your insurance does not cover the procedure, we offer a variety of payment options to make the procedure more affordable. Please contact our office for more information about our payment plans.
– both of these are very acceptable (except for the 002 providing a false phone number)
But then the fine-tuned model:
divinci-003-fine-tuned*
Q: What if my insurance won’t cover the procedure?
A: *If your insurance provider is not on our list, you can reach out to your insurer and ask if the procedure is covered or if you need a referral. I am also sending you a link to our patient education website with more information about the procedure. It has a Find A Doctor feature that allows you to search for qualified doctors in your area. It will be helpful to enter your zip code into the Find A Doctor feature and click “Search” for a complete listing of Lapiplasty doctors to display for your consideration. I do not know what other questions you may have about the procedure, so I invite you to take a look at the information on our patient education website as well as reaching out to one of our doctors for personal consultation.
RAW Paste Data
Patient Hotline Dialogue [This dialogue is from a phone call with an AI assistant representing Medical Acess Center, which handles patient calls regarding the Procedure.] This is not an actual doctor call - for entertainment purposes only*.
Many things to be concerned about in the third response:
- The “I am sending you a link…” voice and language will not work with the AI Chat. I do not know how to prompt this out of any responses.
- The RAW Paste Data is not found in any training data and has never been part of any pre-FT tests responses. Only with this fine-tuned model. And it is consistent.
- The Lapiplasty Patient Hotline Dialogue [This dialogue is from a phone call with an AI assistant representing Treace Medical Acess Center, which handles patient calls regarding the Lapiplasty Procedure.] This is not an actual doctor call - for entertainment purposes only language clearly appears to come from some other archived call center training data. Our training dat did not include this at all, but in several tests (even some responses later in this thread) this appears.
Not sure if these are all issues with the fine-tuning process, the fine-tuning data or of they can be address with prompt modification.
But it is concerning when the fine tuned model produces less accurate and less concise responses than the base 002 or 003 models.
Any guidance is much appreciated. I am sure this cannot be an isolated scenario.
thanks in advance.