Tokenizer

A fast, accurate tokenizer built for OpenAI models. Instantly estimate token usage, validate prompt size, and avoid unexpected API costs — all with model-aware encoding.

Overview

Tokenizer is a native macOS utility that uses OpenAI’s tokenization logic (via the Tiktoken library) to reliably count tokens as they would be processed by OpenAI language models.

Features

Instant token-count for any text input.
Select encoding based on model: e.g., o200k_base for GPT-4o, cl100k_base for GPT-4/GPT-3.5, p50k_base for older Codex models.
Real-time updates as you type or paste text.
Clean, native macOS UI built with Swift & SwiftUI.
Ideal for API developers, prompt engineers and anyone working with language-model usage.

Contributing

Contributions are welcome! If you spot a bug, have feature requests or want to improve token-encoding support:

Fork the repository.
Create a feature branch.
Write tests (if applicable).
Submit a pull request with a clear description of your changes.

Acknowledgements

The original Tiktoken Swift implementation was built by asepinilla, and later modified by me.

License

This project is licensed under the MIT License. See LICENSE for details.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Tokenizer.xcodeproj		Tokenizer.xcodeproj
Tokenizer		Tokenizer
TokenizerTests		TokenizerTests
TokenizerUITests		TokenizerUITests
LICENSE		LICENSE
README.md		README.md
TokenCounterViewModel.swift		TokenCounterViewModel.swift

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Tokenizer

Overview

Features

Contributing

Acknowledgements

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Tokenizer

Overview

Features

Contributing

Acknowledgements

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages