We fined tuned gpt-4.1 on image analysis and it seems the first request is slow because of cold start
- how long does cold start take on average?
- How long do the model stay warmed up? because we want to create an automated task to keep them warm so that our users don’t experience cold starts. We need to know how long do they stay warm? how often should we run the task? every minute? every hour?