Response formatting <b>text</b> instead of text

ai-user1 · March 16, 2024, 6:59pm

I am using chatgpt4 and it’s returning good responses except sometimes there is text that it wants to indicate should be in bold but does so with three *s instead of html tags.

It also does the same for titles with three hashes ### title

Is there anyway to get it to either not include these or use the relevant html tags?

PaulBellow · March 16, 2024, 7:06pm

Heya! Welcome to the forum!

Do you have an example of the system/user/assistant prompt(s) you’re using?

Examples of output?

What settings are you using? Temperature, etc.

You can output HTML, but it’s generally not recommended. Much easier to wrap the output of the LLM with HTML tags.

_j · March 16, 2024, 7:19pm

The AI has been trained (and overtrained) on only producing the markdown format that is used by the ChatGPT renderer.

If you are in ChatGPT, then it is working as OpenAI desires, showing you a display format.

If you are on API, and want the AI to write HTML for your web page as its native response format, this is now incredibly difficult to overcome (but older models that developers still maintain access to like gpt-4-0314 can do with ease by simple instruction).

You’ll need to specifically instruct that the desired output is HTML that you’ll be pasting elsewhere, and the AI’s task is to help you write that HTML. It still will probably be enclosed in a markdown code block for “code” formatting in the ChatGPT renderer.

In use of the API, you can just give up and use a markdown-to-html library just as you would if displaying the output to a user.

ai-user1 · March 16, 2024, 7:23pm

This is what it looks like I’m going to have to do. Is there anyway to not render markdown at all and just render plain text?

anon22939549 · March 16, 2024, 7:25pm

Tell it to respond using plain text.

_j · March 16, 2024, 7:28pm

If you are programming the API by a system message, or Assistants by instructions, the most straightforward approach is to simply tell the AI in a separate “response format” section of text you write that there is no markdown renderer available, output would break the appearance, and that markdown is prohibited, including the characters not to produce.

You can additionally penalize the generation of tokens such as asterisks and hash marks with the logit_bias parameter.

In ChatGPT, you have custom instruction, that are placed similarly - just not as believed. You can tell the AI similarly in a “how to act” box that these are verboten.

ai-user1 · March 16, 2024, 7:35pm

    response = client.chat.completions.create(
      model="gpt-4-turbo-preview",
      messages=[
        {
          "role": "user",
          "content": f"write a few parapgraphs that  total approximately 200 words  on how to: {heading} for a blog titled: {blog_post_title}"
        },
      ],
      temperature=1,
      max_tokens=1000,
      top_p=1,
      frequency_penalty=0,
      presence_penalty=0
    )

how can I modify the prompt to do this?

anon22939549 · March 16, 2024, 7:39pm

https://platform.openai.com/docs/api-reference/chat

ai-user1 · March 16, 2024, 7:42pm

Sorry,. I’m completely new to openai and don’t have a clue where to start and what everything means yet. I’m still learning do i add in a role of system and then in the content tell it not to do things like this?

    {"role": "system", "content": "Do not user markdown formatting, * or #"},
    {"role": "user", "content": "my prompt"},

anon22939549 · March 16, 2024, 7:45pm

Read the instructions.
Try a bunch of stuff.
I’m a few hours, if you haven’t figured it out come back and ask again explaining what you’ve tried so far

This isn’t a free write-my-code-for-me service.

Don’t expect people to put more work into answering your question than you put into asking it.

_j · March 16, 2024, 9:21pm

The instructions can be more extensive, “programming your AI”.

Then we just make it hard for the AI to produce those characters at all algorithmically…


    response = client.chat.completions.create(
      model="gpt-4-turbo-preview",
      messages=[
        {
          "role": "system",
          "content": ("You are Blogmaster, an AI that specializes in writing articles.\n\n"
                      "Article content length will be three paragraphs, each 75 words.\n"
                      "Important: Output will be only plain text. Markdown syntax highlight forbidden"
                      )
        },        {
          "role": "user",
          "content": (f"write an article on: {heading} for a blog titled: {blog_post_title}"
                      )
        },
      ],
      temperature=1,
      max_tokens=1000,
      top_p=0.8,
      logit_bias = {
        674: -20,  # " #"
        5062:-20,  # "#"
        9: -20,  # "*"
        353:-20,  # " *"
        334: -20,  # "**"
        3146:-20,  # " **"
        12488: -20,  # "***"
        17601:-20,  # " ***"
        74694:-40, # ```
        },
    )

Also, purposeless AI blog spam stinks.

Diet · March 16, 2024, 11:08pm

You can also try to upbias the starts of html tags if you’d like:

b = 10
10174: b,   #<h
8085: b,    #<p
34277: b,   #<b
20220: b,   #<ul
49747: b,   #<strong
22659: b,   #<i
10328: b,   #<pre
366: b,     #" <"
27: b       #"<"

itsvnk · March 17, 2024, 4:10am

Two choices:

Add to the prompt saying use HTML codes such as instead of your standard markdowns
Or, if you have some kind of a wrapper or if you switch to the API later, use libraries like GitHub - markdown-it/markdown-it: Markdown parser, done right. 100% CommonMark support, extensions, syntax plugins & high speed to convert to HTML (this is the best option, in my personal view)

amrsa · November 3, 2024, 3:40pm

You can simply include in the instruction that the output must be me in html format starting with div
But the problem that using html format will be more expensive as you will consume more token.

itsvnk · December 6, 2024, 4:48am

Not recommended: gets messy, & depending on how one is integrating the results, it paves way for html/javascript injection

VivacityDesign · December 6, 2024, 6:25am

When you get your response you can sanitize it with replace:

const chatGPTResponse = data.choices[0].message.content.trim();

// bold markdown to B tag
var cleanResponse = chatGPTResponse.replace(/\*\*(.*?)\*\*/g, "<b>$1</b>");

// italic markdown to I tag
cleanResponse = cleanResponse.replace(/\*(.*?)\*/g, "<i>$1</i>");

_j · December 6, 2024, 6:38am

You can … damage your math and other AI output with replace.

Math that looks like 3<strong>3</strong>5 = 45 doesn’t make much sense.

Nor does 335 = 45, were the AI to write 3*3*5.

Even ** in this sentence, where whitespace** is next to a word* and *enclosing the word or sequence is not affected by this forum’s proper parsing of markdown.

* (word can even be this line starting with * that would normally be a bullet point, altered by a regex because there are two of them here)
** (whitespace - characters such as space, tab, or linefeed)

Use markdown → HTML parse libraries specific to the purpose, properly identifying valid commonmark containers.

VivacityDesign · December 6, 2024, 8:32am

Hi Jay, thank you for your objection that is correct for the general principle… in my answer i sticked to the examples given in the question, and wanted to illustrate the concept of string sanitization… obviously to adapt the concept to all cases one must study the various posisble cases and modify the rules or apply new (subsequent) ones…

i.e., this simple change:

.replace(/(?<!\w)\(.?)\*(?!\w)/g, ‘$1’)
.replace(/(?<!\w)(.?)*(?!\w)/g, ‘$1’);

should manage the math problem very well and also allow for checking multicharacters / multispaces text like

text in italics → text in italics
text in bold → text in bold

I admit i cannot test it right now but i am confident it works for most cases.

| _j
December 6 |

| - |

VivacityDesign:

you can sanitize it with replace

You can … damage your math and other AI output with replace.

Math that looks like 335 = 45 doesn’t make much sense.

Nor does 335 = 45, were the AI to write 335.

Even ** in this sentence, where whitespace** is next to a word* and *enclosing the word or sequence is not affected by this forum’s proper parsing of markdown.

(word can even be this line starting with * that would normally be a bullet point, altered by a regex because there are two of them here)
** (whitespace - characters such as space, tab, or linefeed)

Use markdown → HTML parse libraries specific to the purpose, properly identifying valid commonmark containers.

Topic		Replies	Views
Can't seem to eliminate markdown format API gpt-4 , gpt-4-vision	15	3541	February 21, 2025
GPT-4-o returns answers with Markdown Prompting gpt-4 , output-markdown , gpt-4o	15	10061	July 12, 2024
Formatting plain text to markdown Prompting api	2	10071	January 21, 2024
Can't get the API to convert to HTML for my front end Prompting gpt-4 , output-html , output-markdown	19	8157	November 25, 2023
Help me fix my prompt that interprets text as instructions Prompting api	10	411	January 1, 2025

Response formatting <b>text</b> instead of ***text***

Related topics

Response formatting <b>text</b> instead of text