Hello, I was wondering if it would be possible for Chat GPT to have the capability to mark up images. Although I appreciate GPT-4 Vision, there have been instances where I provided Chat with a diagram or a drawing of a complex chemical structure or a UML diagram and asked a question about it. Chat …

Providing Chat GPT the Ability to Mark-Up Images

_j February 7, 2024, 10:45pm 2

It would be “easy”, but not easy for gpt-4-vision. Grounding, bounding boxes, entity identification, etc is not part of the AI.

Azure, for example, can layer different vision models to perform such a task.

1 Like

Topic		Replies	Views
Why is native image markup still a hurdle for GPT models? And is Open AI working on such capabilities? Community chatgpt	0	82	February 18, 2025
What to expect after chatGPT and DALL-E API	1	613	December 29, 2022
ChatGPT goes Multimodal! Sound and vision is rolling out on ChatGPT Community chatgpt , multimodal	34	13971	December 10, 2023
GPT-4 Photo Analysis Capabilities API gpt-4	1	4336	December 18, 2023
Feature Request: First-class Object References for Function Calls Feedback api , feature-request , community-feedback	3	808	December 2, 2023