I’m facing a similar problem, all the APIs at least take> 10 seconds of latency. I have tried using different values for temperature but no luck.
I use below configs
temperature: 0.66,
max_tokens: 2147,
top_p: 1,
frequency_penalty: 0,
presence_penalty: 0,