I’m developing an application for non-streaming conversations with assistants via API, primarily utilizing files stored in a vector database. While everything seems to work fine initially, I’ve noticed an inconsistency in response behavior.
The first response to any question asked via the API is almost identical to the response generated in the Playground—exactly as expected. However, the issue arises with subsequent questions. The answers, though not incorrect, differ slightly between the API and Playground. Both responses cite the same source file, but the phrasing or details in the API-generated answer vary.
Interestingly, when I ask the second question in a new thread (both in the Playground and via API), the initial response matches perfectly again, just like the first question did.
I’ve ensured that all parameters—temperature, top-p, model, etc.—are identical for both the Playground and the API requests. Given this, I’m wondering if this could be a bug in v2 of the API. Many users have noted that the API should behave similarly, if not identically, to the Playground.
Has anyone else experienced this issue?