How many words in Cyrillic can you get from a million tokens?

Have a look at this post that discusses some of the reasons for inconsistencies: