GPT-4-Vision Interesting Uses and Examples Thread (2023)

Can it help me with my RTS game? Not likely … yet.

Didn’t OpenAI play StarCraft for a while?

I used it to search the surveillance camera
Upload video on VideoDB.
Use GPT4 vision to analyse footage.
Search for queries like “dogs on street”

I’ve tried an experiment in reading the results of D&D dice rolling (using ChatGPT actually, but i suppose the model is the same).

It doesn’t work well: it has problems reading the correct number zero-shot. Asking to reiterate sometimes improves the detection.

In this case:

  • it detects only one d20 die (the dark grey one)
  • the numbers are wrong

The OCR capabilities of the vision model are not great with text that is not horizontally aligned. Hopefully that will be addressed in updates.

1 Like

Here’s my app which features an AI powered NPC using ChatGPT4 Vision to analyze its surroundings :

(Edited to show using GPT 4o for image processing)

2 Likes