Prompting GPT3.5 for NER data labeling

jr.2509 · January 24, 2024, 8:31pm

Ok. If you don’t have a closed-ended list of terms that is fine, too. The model should still pick up the overall pattern.

You should include examples of NSFW terms in your training set for the model to understand how to treat these.

In terms of JSON, yes you can instruct the model via fine-tuning to respond in a desired JSON format. Again here, tried and tested and works very well. I agree that in a non-finetuned setting, GPT-4 is inherently better at this but you can definitely get consistent JSON results with a finetuned GPT 3.5.

Finally, ensure your system prompt is specific. If you are for instance worried about the volume of words for a given category, then simply include restrictions in your system prompt in this regard (i.e. no more than X).

Topic		Replies	Views
How to improve a fine-tune classifier? Prompting	10	1494	August 15, 2022
Is it possible finetune with unlabeled data and then labeled data? API fine-tuning	5	1154	March 18, 2024
Having trouble to make AI avoid certain topics Prompting	13	4223	April 17, 2022
Struggling with fine-tuning GPT for generating JSON API fine-tuning , fine-tuning-problems	1	418	July 9, 2024
Fine Tuning Help defining Prompt/Completion API	18	2567	January 14, 2026

Prompting GPT3.5 for NER data labeling

Related topics