Optimizing GPT4 request & best practices

I’m wondering what can be done to minimize the latency of GPT4 API requests. It appears that the latency depends on the size of the prompt but I’m wondering if there are any other ways to optimize response time

Thanks