API key used from one account gives avg 2 minutes response time, when used from other account its 12s to 16s response time with same request

It is like you discover, not the application, but your account and the gpt-3.5-turbo models.

Maybe they bought new datacenter services for “special” users made of gamer GPUs now capable of running a vastly reduced gpt-3.5-turbo? Nobody knows.