TQ1.0 ternary inference engine for BitNet b1.58 on CPU. Pack + run Falcon3-1B/3B/7B/10B, no GPU needed.
-
Updated
May 31, 2026 - Python
TQ1.0 ternary inference engine for BitNet b1.58 on CPU. Pack + run Falcon3-1B/3B/7B/10B, no GPU needed.
Windows-native BitNet and ternary LLM inference with CPU GGUF, GPU runtime, terminal and browser chat, and release zips.
Run BitNet 1.58-bit and ternary LLMs on Windows with CPU and GPU inference, chat tools, and release-ready builds
Add a description, image, and links to the falcon3 topic page so that developers can more easily learn about it.
To associate your repository with the falcon3 topic, visit your repo's landing page and select "manage topics."