Codex Tokenizer Logic

simonl · December 29, 2021, 3:32am

I decided to bite the bullet and built one for NodeJS - GitHub - xanthous-tech/gpt3-tokenizer: Isomorphic Tokenizer for GPT3 algorithm for OpenAI.

I have incorporated the best from what is out there and followed the minified code from the OpenAI tokenizer demo page. Currently it supports NodeJS but I should be able to quickly make it available on browser (the same as the token counting on playground). It should also properly count tokens for codex models since it is merging continuous spaces as single token.

Topic		Replies	Views
Token counter (Codex) for a back-end service API codex	8	2324	July 9, 2024
Feature request: Query token counts via API Prompting	3	1685	May 24, 2022
Struggling to get correct token count Community gpt-4 , gpt-35-turbo , api	3	2093	December 29, 2025
What is difference between GPT2 and GPT3 tokenizers? API	1	1966	February 21, 2024
What is the OpenAI algorithm to calculate tokens? API	35	33397	December 13, 2023

Codex Tokenizer Logic

Related topics