So getting ChatGPT to output long string dumps similar to how a data image is structured- is one of the methods folks are using to get OpenAI to show some of its training data. They have been locking down things behind the scenes, but have been a little vague on limits while they are tweaking things. I have not been able to find any officially published data with express max limits, but people’s speculations.