Loading...

TokenMonster Tester

Instructions  |  GitHub  |  Benchmark
TokenMonster Fiction (4096) Vs. GPT2 Tokenizer (50256)
Fiction 1024 Vs. 100256
EnglishCode 24000 Vs. 100256
GPT2 Tokenizer Vs. EnglishCode 50256
LLaMa Tokenizer Vs. Fiction 32000
Just how well can it tokenize code?