Fine-tuning - How to deal with bullets in the answer text?

dg · September 7, 2023, 4:19pm

What is the best way to implement enumerations in the content area when fine-tuning?

As a comma-separated list,
preceded by a bullet?
on a separate line?

Is there a “best practice” for this?

_j · September 7, 2023, 6:26pm

ChatGPT and thus the underlying gpt-3.5-turbo are pretrained on producing markdown format.

Markdown is relatively simple, a minus sign (-) and a space, and you’ll begin a bullet unordered list. A number, a period, and a space, and that instead is a numbered list. It is the same format this forum uses, which you can test in the reply box.

The basic list structure can also be displayed as plain text without need for a UI interpreter (while bold text with two asterisks surrounding it must be rendered).

You can certainly demonstrate other outputs in your fine-tune examples, even training the AI to write HTML, but that’s a larger hurdle.

sergeliatko · September 7, 2023, 10:54pm

Personally I find it performs better if:

numbered list in instructions,
bullet list in answers when items are long or have their own syntax
comma separated if list and items are short in answers
comma separated list if answer to summarizing to a list of subjects/keywords
bullet list of summarizing to longer items

But then again, it’s just an opinion

Topic		Replies	Views
Formatted response from chatgpt Community gpt-4 , chatgpt , api	3	1780	December 22, 2023
How to stop numbered lists in output Prompting gpt-4	8	1850	July 3, 2023
Create bullets automatically Prompting	2	416	May 1, 2022
Prompt to get a specifc number of bullet points Prompting chatgpt	5	1183	August 11, 2023
Reading and summarizing technical documents Prompting gpt-4	4	808	November 8, 2023

Fine-tuning - How to deal with bullets in the answer text?

Related Topics