Prompt for Image to JSON conversion

cg27031998 · July 15, 2024, 11:59am

I have process diagram present in a image. I am converting this image to base64 encoding and attaching it to my prompt. I have given instructions to prompt to understand the image and give output in the JSON structure which I have provided in my prompt. It works but not always. I am looking to improve its accuracy.
What are the specific things to look when I am writing prompt for such conversions?

Diet · July 15, 2024, 12:11pm

Welcome to the community!

This is generally a tricky undertaking because of the way the models hallucinate.

I would suggest adding a “thinking” field to the top of your json schema, where you tell the model to explicitly reason about the contents of the image before translating it into your specific json structure.

The issue is that as soon as the model misrepresents something, it’s very difficult to iron that kink back out again. That makes using LLM vision in production not easy.

What specific failure modes are you encountering? It’s probably best to work the issues out one by one until you get to an acceptable level of reliability.

Topic		Replies	Views
Need help in prompting of recognizing annoted elements in the image Prompting gpt-4 , api	3	483	May 24, 2024
Prompting for OCR (question) Prompting chatgpt , gpt-4o	1	1707	May 22, 2024
Data points in tables and charts in images Prompting gpt-4	6	1304	June 25, 2024
Valid json every time? Prompting	17	11585	January 3, 2024
Prompt integrating JSON, or JSON request after the prompt API chatgpt , api , json	4	17410	May 29, 2023

Prompt for Image to JSON conversion

Related topics