How can I prevent a fine-tuned model from generating a lot of END tokens?

ionascumihaig · February 17, 2023, 3:15pm

Hi friends,

I’ve been trying to experiment with fine-tuning an ada model to ouput IaC files, however the model generates random tokens to the max lenght I’ve set in the portal, is there any way to stop this from happening?

ruby_coder · February 18, 2023, 3:26am

Please post an example of the prompt and the expected completion.

Also, please post example lines of your JSONL fine-tuning training file and note the parameters you used when you fine tuned your ada base model.

Please post actual text and not screen shots, because to help you we can easily test and we do not want to type from a screen shot (take too much time and so that will not happen), but copy-and-paste from the actually text (so we can test quickly and accurately).

Thanks @ionascumihaig

ionascumihaig · February 19, 2023, 1:22pm

@ruby_coder Thanks, for taking the time to answer.

Prompt:

This module create a storageAccount resource with apiVersion 2021-01-01.

Expected output:

{‘type’: ‘microsoft.storage/storageaccounts’, ‘apiversion’: ‘2019-04-01’, ‘name’: "[variables(‘diagnostic_storagegroup_name’)]", ‘location’: "[parameters(‘location’)]", ‘sku’: {‘name’: ‘standard_lrs’, ‘tier’: ‘standard’}, ‘kind’: ‘storagev2’, ‘properties’: {‘networkacls’: {‘bypass’: ‘azureservices’, ‘defaultaction’: ‘allow’}, ‘supportshttpstrafficonly’: false, ‘encryption’: {‘services’: {‘file’: {‘enabled’: true}, ‘blob’: {‘enabled’: true}}, ‘keysource’: ‘microsoft.storage’}}} END"}

JSONL file used for fine tuning: train_prepared.jsonl - Google Drive

Topic		Replies	Views
Finetuned GPT-3 Repeating responses API fine-tuning	1	601	July 12, 2023
How to overcome OpenAI fine-tuning training data token limit? API api	5	2326	December 18, 2023
How to stop a fine-tuned model from generating additional tokens? API	2	1469	February 23, 2022
Fine-tuned model in a chatbot gives responses for both the chatbot and the user API	5	2262	March 16, 2023
Struggling with poor performance on fine-tuned davinci model API	15	2588	December 20, 2023

How can I prevent a fine-tuned model from generating a lot of END tokens?

Related topics