The Completions API doesn't really return completions

coder0xff · August 26, 2024, 5:55pm

If I send this prompt for completion:

"The following is the list of the descending positive integers starting from 9. The list is complete. 9 8 7 6 5 4 3 2 "

I believe the completion of this ought to be “1”, and nothing more. However, AsyncOpenAI.chat.completions.create is more interested in having a chat:

“The list you’ve provided is a sequence of descending positive integers starting from 9 and decrements by 1 each time. If the list is complete with respect to only including single-digit positive integers 9 down to 2, then the sequence you’ve given is accurate. However, if the sequence is meant to include all positive integers down to 1 in a descending order, then the number 1 is missing at the end of the list. It should be:\n\n9 8 7 6 5 4 3 2 1\n\nEach number in the sequence is one less than the number before it, and this sequence is typically known as a countdown. The integer 1 is the smallest positive integer and would complete the sequence of single-digit positive integers from 9 down to 1.”

Now to the LLM’s credit, it knows what the list should contain. but I would not describe its response as a completion of my input. Though my knowledge of LLMs is limited, I imagine the most likely predicted next token following my input is “1”, but whatever layers reside between my API call and the transformer are complicating things in undesirable ways for this use case. Is there a way to achieve a sort of “raw” mode? Thanks.

PaulBellow · August 26, 2024, 6:07pm

Welcome to the community!

Are you using gpt-3.5-turbo-instruct?

Are you limiting max_tokens? Might also ask for completion in the system prompt.

coder0xff · August 26, 2024, 6:13pm

Hi Paul. Here’s my invocation.

task = _remote_client().chat.completions.create(
    model="gpt-4",
    messages=[{"role": "user", "content": content}],
    temperature=1,
    max_tokens=4096,
    top_p=1,
    frequency_penalty=0,
    presence_penalty=0,
)

Might also ask for completion in the system prompt.

Sorry, would you elaborate? I don’t know what that is.

Diet · August 26, 2024, 6:31pm

GPT-4 is unfortunately just a chat model

instruct are the actual completion models:

you can play with it here:

https://platform.openai.com/playground/complete

Is that what you meant?

bodeip1 · August 26, 2024, 6:41pm

You need to give it a few shots if you want an exact response like that, at least one or two. Look up few shot prompting.

coder0xff · August 26, 2024, 6:42pm

I see. My example in the my original post is of course just a contrived toy example. What I’m really after is completion of Python code. I think I’ll be better off trying to programmatically cope with GPT-4’s difficult behavior in exchange for its superior ability.

coder0xff · August 26, 2024, 6:43pm

In my real use case (not the toy example I gave), I gave it 50 examples. xD

bodeip1 · August 26, 2024, 6:53pm

Not sure about 50 examples, I usually have the responses reduced down to a single number sequence with one shot. Also, I usually add in a line that says you will fail if you respond with anything except the single number.

PaulBellow · August 26, 2024, 7:17pm

Ah, yes, that’s a Chat Completion model, not Completion (Legacy)…

/v1/chat/completions All GPT-4o, GPT-4o-mini, GPT-4, and GPT-3.5 Turbo models and their dated releases. chatgpt-4o-latest dynamic model. Fine-tuned versions of gpt-4o, gpt-4o-mini, gpt-4, and gpt-3.5-turbo.

/v1/completions (Legacy) gpt-3.5-turbo-instruct, babbage-002, davinci-002

More info here…

https://platform.openai.com/docs/models/model-endpoint-compatibility

fluxtah · August 26, 2024, 10:23pm

Did you try adding “no prose” in your prompts.

Also try using user role instead of system.

Also maybe add a label such as

Output:

Right at the end of the prompt to indicate the output can force it to do the minimal without explanation (prose).

scottswigart · August 26, 2024, 10:39pm

Try something like this:

predict the next number in this sequence. put the next number in a <next_number> tag

9, 8, 7, 6, 5, 4, 3, 2

Topic		Replies	Views
Achieving Text Completion with GPT-3.5 or GPT-4: Best Practices (Using Azure Deployment)? Prompting gpt-4 , gpt-35-turbo , playground , text-davinci-003	4	2807	August 29, 2023
Completions API response always considerably shorter than ChatGPT response Prompting api	4	1737	March 30, 2024
GPT4 base model for only completion Prompting gpt-4	5	1675	November 15, 2023
No way to get simple next token prediction? API	3	1015	May 2, 2024
I want it to do less responding to my query API	3	319	March 4, 2024

The Completions API doesn't *really* return completions

Related topics

The Completions API doesn't really return completions