Question 1

How accurate is the token estimate?

Accepted Answer

It is a heuristic, not exact. Real tokenizers split text using a learned vocabulary, which needs the model's tokenizer files. This tool uses the well known rule of about 4 characters per token for English, with denser ratios for code and non-Latin scripts. For most prompts it lands within about 10 to 20 percent of the true count, which is enough to size a prompt or estimate cost.

Question 2

Why do GPT, Claude, Llama, and Gemini show different numbers?

Accepted Answer

Each family uses a different tokenizer, so the same text splits into a slightly different number of tokens. The tool applies a small per family adjustment to the baseline estimate to reflect that. They are still estimates, so treat the differences as approximate.

Question 3

How is the cost calculated?

Accepted Answer

It multiplies the estimated tokens by a representative input price per million tokens for a common model in each family. The table also lists the output price, since replies are billed separately. Pricing changes over time and varies by exact model and provider, so check the provider for current rates.

Question 4

What counts as a token?

Accepted Answer

A token is a chunk of text a model reads, often a short word or part of a word. A rough guide is about 0.75 words per token in English, so 100 tokens is roughly 75 words. Punctuation, spaces, and rare words can each take their own token.

Question 5

Does my text get uploaded?

Accepted Answer

No. The text is analyzed locally in your browser. Nothing is uploaded or stored, so it is safe to paste private prompts.

Model family	Input $ / 1M	Output $ / 1M	Est. input cost
GPT-3.5 / GPT-4 (OpenAI)Price based on GPT-4o	$2.50	$10.00	$0
Claude (Anthropic)Price based on Claude Sonnet	$3.00	$15.00	$0
Llama (Meta)Price based on Llama 3.x, typical host	$0.60	$0.60	$0
Gemini (Google)Price based on Gemini Flash	$0.10	$0.40	$0

AI Token Counter

How to use

Examples

FAQs

Related tools