I’m quite dissatisfied by the performance of GPT-4 Engine. As can be seen by the enclosed image, engine was 11 times wrong at a simple/mediocre JS problem 3 of which its assumed code outputs (of its own code) was also wrong.
It reached the solution at 12th try which is also too long and need to be refactored.
PS. Refactoring was also a mess and recreated all the errors again. It also tried 10 times to refactor the code. I have the conversation saved but did not post here. After 10th try it still could not succeed although it stated that it did with some exact assumed output (which was wrong). At 10th try code was more or less same length with the solution so I give up.