GPT-Vision - item location, JSON response, performance

Request for features/improvements:

  • Performance improvements (response time for 50-200 tokens on a 1MB image takes 10-15 seconds with detail parameter set to low
  • Item location - API to include location data of items in the image
  • JSON response format
1 Like