if every week several different prompts that have different types of results and/or focus were given to the models and then recorded, then there would be a factual record of the changes.
I tend to think the general idea that the models change with time is valid but there is a lack of recorded evidence over time.