Rules of Thumb for number of source code characters to tokens

Thanks. I also found this older post that was interesting.

I finally picked a simple character count for the samples, and left the choice of tokenizer to the library client code.

1 Like