Prompts as psuedo-code - where are the limits?

Surprisingly, I CAN omit display of everything except ‘Evaluate the result’. GPT can’t do math. Unless it explicitly displays the evaluation result, it guesses incorrectly that an expression evaluating to 11 in fact satisfies the problem.