Get same_input -> same_output always. force argmax + random seed + clean token queue in generation. what is possible?

_j · September 20, 2023, 10:29am

I don’t work for AI and didn’t program models. We can only extract evidence.

The randomness is obviously not reset to a fixed state between API calls, as this would defeat the purpose of diversity sampling for varied answering, and only ensure the same low-perplexity response each time.

You can replicate this prompt and the next token that it produces (the 46th context element) as one example to probe a case where two top probabilities are almost identical:

More:

The latter gives you sample chat endpoint code and gpt-4 results, and we also have the new gpt-3.5-turbo-instruct completion model with raw input that returns logprobs to experiment with.

Topic		Replies	Views
Just got access to GPT-4 but it responds like 3.5 API gpt-4	13	8131	July 8, 2023
The inconsistent responses between api and the website version API gpt-4 , chatgpt , api	11	3074	December 14, 2023
Is API GPT4 way less intelligent than ChatGPT4? API gpt-4 , chatgpt	14	8634	July 17, 2024
GPT-4 through API says it's GPT-3 🤔 API gpt-4	18	21456	December 25, 2023
Why the API output is inconsistent even after the temperature is set to 0 API gpt-4	11	23830	December 21, 2023

Get same_input -> same_output always. force argmax + random seed + clean token queue in generation. what is possible?

Related topics