Looks like openai v2.22.0 lib now has websockets implementation for the Responses API, which should significantly improve the time-to-first-token, Release v2.22.0 · openai/openai-python · GitHub. On Codex-Spark it was 50% improvement so let’s see.
5 Likes
Seems to be good for tool calling.
4 Likes