How to format context documents to allow model to recognize specific fields within documents

Well, what do you know? I posed the question to GPT-4.

Formatting the input in a structured manner can help the model better understand the data it’s processing. You can try adding extra line breaks and colon characters to clearly separate the fields. Here’s an example:

Question: What documents are available with the title “Drupal AI SolrAI - CSS”? Number of context documents: 2

-----
Context document 1:
Document Title: Drupal AI SolrAI - CSS
Content: blah, blah…
Event Date: 
Document Groups: Documentation, Development, D9 Development, AI
Document Taxonomy/Tags: 
Source Document Summary: blah, blah…
URL: https://master1and1-9.schoolboard.net/node/1445
-----

Context document 2:
Document Title: AI: Drupal SolrAI Module
Content: blah, blah…
Event Date: 
Document Groups: Documentation, Development, D9 Development, AI
Document Taxonomy/Tags: AI ToDo
Source Document Summary: blah, blah…
URL: https://master1and1-9.schoolboard.net/node/1402
-----

Each field in the context document is now on its own line, which could make it easier for the model to identify them. The use of “-----” to separate the documents remains the same.

I made that change, and now this is what I get:

It’s not perfect, but WAY better than what I was getting before. Thank you GPT-4! Again!

4 Likes