ChatGPT's Response to Context in Coding Prompts: Snake tests

maurizio.chiaro · December 15, 2023, 12:40pm

TL;DR: tested ChatGPT’s response to the same coding request (“Ok write a python script that recreates the game snake”) set in different contexts. Found notable variations in the scripts’ complexity, structure, and playability, highlighting how prompt context shapes ChatGPT-generated code.

Experiment Setup

I gave ChatGPT the same final request but embedded it within different contextual scenarios. These ranged from straightforward prompts to ones with specific constraints or altered scenarios (like a tight deadline, instruction on providing only code-block without thought process, etc…).

Findings

Each different context led to a unique version of the Snake game, varying in code complexity, structure, and user experience. Some versions were basic, while others incorporated advanced features and refined gameplay mechanics. Best results were obtained by providing Custom Instructions, specifying to answer only in code blocks starting and ending with ```.

Repository and Evaluation

I’ve documented the entire process in a GitHub repo, which includes:

The transcribed chat prompts and chat links in the prompts folder.
The resulting Python scripts in the scripts_output folder.
An evaluation table, created by GPT4 itself, rating each script on code structure, complexity, performance. Also provided my personal feedback on playability.

image1485×835 63.6 KB

Insights and Discussion

Test the different scripts yourself, I will run more tests in the next days with different coding tasks, but keeping the same prompts. If anyone is interested I’ll share my findings.

_j · December 15, 2023, 3:21pm

Coding prompt the better way:

Don’t restrict the AI’s output; instead expand on the AI’s language production with planning ability;
Define programming environment for clarity;
Write better game instructions, or let the AI rewrite to better game specifications before starting.

I upgrade the model and the fun to ChatGPT 3.5 - where 10+ iterations would cost the same as one gpt-4-turbo response (or infinitely more free in ChatGPT).

https://chat.openai.com/share/b1491265-5144-49c1-ab96-65ba56b93994

maurizio.chiaro · December 15, 2023, 4:45pm

4 iterations and it still looks like you are the one assisting gpt . It’s almost like it confuses itself with all the thought process and unnecessary descriptions of what it’s about to do.

_j · December 15, 2023, 5:01pm

Hey it’s gpt-3.5-turbo: what one would have dismissed earlier for coding if gpt-4 hadn’t been taken down a peg.

The prediscussion is chain-of-thought, giving the AI planning ability. The AI text generation is one-directional; it can’t go back and add an “import” statement when it comes to the point of writing code tokens for a task and discovering the need for a library or extra function.

Did you try out what the AI actually wrote? Your “medium” is silly logo turtle code with “pen up” etc. that does nothing. You’re going to need a lot of manual assisting on that junk.

I added “rewrite the user’s specification” as a task the AI must also do…and handed the prompt I wrote over to gpt-4-turbo (gpt-4-preview-1106). Not GPT-4.

Guess what - as hint of its origins and qualities, it also gives a traceback at first go, then the code has the same faults as gpt-3.5-turbo: gameplay is too fast, it needed the window dimensions similarly squared, and wraps instead of crashing into the edge. But then to add insult it goes into code omission mode, filling its output with ellipsis elide markers. Then it again draws the food in the border wall I specify. Not worth wasting more tokens on what is clearly the same path as gpt-3.5.

You can try out no prompt except the date on lesser than GPT-4 and see how that works out for you…

Topic		Replies	Views
Fine Tuning on Good Python Code API fine-tuning	10	3905	July 17, 2023
Amazing tests with chatGPT to push its limits Prompting	1	1869	December 23, 2022
AI creates game in Python in 140 lines of code API codex	15	2715	August 22, 2021
ChatGPT's Role Reversal? - When AI Asks Users to Take the Wheel Prompting gpt-4 , chatgpt-plugin	2	1587	January 6, 2024
Is ChatGPT 4 really good at coding? Feedback gpt-4	3	7989	March 14, 2025

ChatGPT's Response to Context in Coding Prompts: Snake tests

Experiment Setup

Findings

Repository and Evaluation

Insights and Discussion

Related topics