GPT-4 is kinda slow right now as I bet a lot of people are taking it through its paces.Also, longer prompts require more processing and come back slower.
Text-davinci-003 is a lot more stable and quick now, I’ve noticed…I’m sure GPT-4 will stabilize (and hopefully drop in price some…)
How long is your timeout in your code? What language are you using?
How do we configure for the https://api.openai.com/v1/chat/completions endpoint? I have the gpt-4 invite, but I can only get previous engine versions such as “text-davinci-003” working.
This is what I have in my python web app currently, and it’s working.
If I input “gpt-4” as the engine, I get the error openai.error.InvalidRequestError: This is a chat model and not supported in the v1/completions endpoint. Did you mean to use v1/chat/completions?
@bill.french and @PaulBellow I am using curl call on a web app. Perhaps I know what is wrong. Is the response still a json after you add ’ “stream”: true '?