Scenario:
I’m working on extracting content from semi-structured documents that include tables, text, and other elements. The extraction follows a consistent pattern for a table: retrieving no_rows × no_columns items from the document.
Issue:
-
GPT-4.1 Output:
GPT-4.1 often stops after partially extracting data (e.g., first 100+ entries) and then provides reasons like brevity or space limitations. -
GPT-4.1-mini Output:
While GPT-4.1-mini introduces some hallucinations, it successfully returns the desired number of items in full.
What I’ve Tried:
As suggested in the GPT-4.1 Prompting Guide Caveats, I strongly instructed the model to output all entries in full using prompts like:
Extract **ALL** (hundreds of entries) from the above data in **FULL**, following the instructions and rules provided in the system message.
Despite this, GPT-4.1 still truncates the output and cites space/brevity concerns.
Question for the Community:
-
Is this behavior expected for GPT-4.1 due to token or safety constraints?
-
Are there any proven strategies or prompt engineering techniques to force GPT-4.1 to return complete outputs without truncation?

