it’ll never be as fast as chat completions, it has a context / history of messages to process for every response.