Leading whitespace missing in GPT4 response

james.kirk · January 30, 2024, 3:13am

Hello.
I have recently encounter an issue with the API and GPT4, where it does not generate a leading white space when appropriate.

A trivial example:

text = “The capital of France is”
message = [
{“role”: “user”, “content”: text}
]

result = openai.ChatCompletion.create(
model=“gpt-4”,
messages = message,
max_tokens=50,
top_p=1,
temperature=0.0,
stop=[“.”, “\n”])

print(result)

The output is “Paris” with no leading space.

…
“model”: “gpt-4-0613”,
“choices”: [
{
“index”: 0,
“message”: {
“role”: “assistant”,
“content”: “Paris”
},
“logprobs”: null,
“finish_reason”: “stop”
}
…

Compare this to the response for completion with 3.5-turbo-instruct:

result= openai.Completion.create(
engine=“gpt-3.5-turbo-instruct”,
prompt=text,
max_tokens=50,
top_p=1,
temperature=0.0,
stop=[“.”, “\n”])

print(result)

The output is " Paris" with a leading space.

…
“model”: “gpt-3.5-turbo-instruct”,
“choices”: [
{
“text”: " Paris",
“index”: 0,
“logprobs”: null,
“finish_reason”: “stop”
}
],
…

For my purposes I cannot know if a space needs to be appended for the completion. I am performing a beam search using top_logprobs and the completion may start in the middle of a word (e.g., “In France, the capit” the response should be “al is Paris”).

Is this due to the nature of chat completions? Will it never generate a leading whitespace?

Thanks!

anon22939549 · January 30, 2024, 3:48am

I think we can chalk this up to the difference between the chat/completions endpoint and the completions endpoint.

The key element being chat.

In the chat/completions endpoint, the paradigm is two entities having a conversation. It would be unusual if you texted me “the capital of France is” and I responded " Paris".

Whereas the paradigm in completions is a single, uninterrupted stream of text that is continually appended to. So it makes sense to add the missing space.

james.kirk · January 30, 2024, 4:03am

Thanks, yeah that make sense for a chat.
For each chat response, we were trying to do a expansion of a tree of responses using top_logprobs and repeated API calls to the chat endpoint.

So looks like either we will have to add logic to determine if a space is needed or instead continue to rely on the completions endpoint and pre-GPT4 models.

PaulBellow · January 30, 2024, 4:06am

A few of us are actively asking for a GPT-4-turbo-instruct… No word on whether we’ll get it or not.

james.kirk · January 30, 2024, 4:25am

Ah yeah, that would be great if they made a GPT-4 instruct model!

_j · January 30, 2024, 7:34am

Simply, the chat models cannot complete text naturally.

Messages are contained within containers of special tokens.

Additionally, there is a “prompt” added at the end of your input messages, the start of a new message where the AI will write

<special_token>assistant<special_token>(complete here)

So it is seen as the equivalent of writing at a new line, and starting a new response.

Completion is a simulation done by fine-tuning on the behavior. The logprobs was twisted into not returning actual token numbers to make it less useful.

You can have it repeat back your last part (“start at the last line and continue composing”), and then continue writing, and then strip the matching start, if you want a technique that can slightly work sometimes.

Topic		Replies	Views
The Completions API doesn't really return completions API prompt-engineering	10	115	August 26, 2024
GPT4 completion model python api API	4	1647	December 17, 2023
Why is GPT API giving me a response with lots of spaces and new lines? API gpt-4 , chatgpt , api	9	2378	November 26, 2024
Achieving Text Completion with GPT-3.5 or GPT-4: Best Practices (Using Azure Deployment)? Prompting gpt-4 , gpt-35-turbo , playground , text-davinci-003	4	2628	August 29, 2023
Issue with AI-generated text: Concatenated words without spaces Prompting api	9	1739	July 17, 2023

Leading whitespace missing in GPT4 response

Related topics