GPT-4-vision extraction of tables with branched rows/vertically-merged cells

The issue as you’ve found is that you are confined to only prompting…You could try to manually adjust the image itself or even find some consistent structures and automatically cut the tables out and then query them individually, but this process has already been accomplished using these table-OCR models.

In your tests what has made you lean more towards frustratingly prompting GPT-V?