It is indeed documented that mid-context gives poorer results of understanding, and the more the AI writes, the more the end becomes the mid.
So if you really want a technique that is unbiased by position, you can shuffle the sections and internals up a section at a time and pop them back to the bottom, and from 6 or 12 API runs (or whatever gets you back to the initial state), choose the best 10 out of the best 10s by appearance count.
Another would be to score them individually, giving the AI reference point scenes and scores as examples.
More: If you really want the AI to be a judge and you aren’t simply judging the AI, give it the original section and modifications. It can’t tell if the person is recommending exactly what’s already written without this knowledge.