Spatial awareness of small tabled data. Best method?

stevenic · October 5, 2024, 7:45am

This works with gpt-4o:

Row 0:
[0,0] Alice
[0,1] Jim
[0,2] Stuart
[0,3] William
[0,4] Angela
[0,5] June
[0,6] Wendy 
[0,7] Tim

Row 1:
[1,0] Rick
[1,1] Laura
[1,2] George
[1,3] Rowan
[1,4] Isla
[1,5] Helen
[1,6] Henry
[1,7] Calum

Row 2:
[2,0] Fred
[2,1] Arthur
[2,2] Pamela
[2,3] Ben
[2,4] Kate
[2,5] Amy
[2,6] Philip
[2,7] Paul

Row 3:
[3,0] Mary
[3,1] Pat
[3,2] Kelly
[3,3] Alan
[3,4] Lily
[3,5] Dan
[3,6] Steve
[3,7] Mike

Row 4:
[4,0] Mat
[4,1] Cameron
[4,2] Duncan
[4,3] James
[4,4] Oliver
[4,5] John
[4,6] Aulay
[4,7] Connor

The model sees this list as:

Row 0: [0,0] Alice [0,1] Jim [0,2] Stuart [0,3] William [0,4] Angela [0,5] June
[0,6] Wendy [0,7] Tim Row 1: [1,0] Rick [1,1] Laura [1,2] George [1,3] Rowan [1,4] Isla [1,5] Helen [1,6] Henry [1,7] Calum etc…

The model may see in 1D but it actually does a decent job of mapping that information spatially. You can help it out by giving it anchors… The “Row n:” gives the model anchor points to know where clusters of tokens start and clusters is probably the best way to think about it. The token “Row” puts the model in the right frame of mind to think spatially. It knows that rows can potentially be above and below each other. The cell coordinates gives each name an anchor that can be reasoned over (or at least fake reasoned over because they can’t truly reason.) Through RLHF the model has learned that 2 comes before 3 and 4 comes after 3 and so on.

You can’t just ask who’s diagonally below lily because that’s not specific enough. You have to ask below and to the right. Everything is really a function of how far a value is from it’s label/anchor. The closer a value is to an anchor that has semantic meaning the more likely you are to get an accurate answer. When you have just a bunch of names separated by pipes (|) there’s nothing for the model to latch on to.

Hope that helps…

Topic		Replies	Views
GPT-4-vision extraction of tables with branched rows/vertically-merged cells Prompting gpt-4-vision	9	2426	March 8, 2025
Puzzle for human user and their ChatGPT-3.5 Prompting	3	2733	April 26, 2023
Assistance Required for Improving GPT's accuracy and consistency in generating responses for structured tabular data queries API	3	142	January 20, 2025
Do you also have this problem or maybe you found the solution? Does OpenAI have official information about this situation. Is anyone is aware of OpenAI position on the topic? Prompting	3	1925	December 13, 2023
Answering questions about excel-sheet - random response? API	4	231	October 6, 2024

Spatial awareness of small tabled data. Best method?

Related topics