HiveTerminal

The First Terminal-Based Agentic Coding Assistant That Solves the Token Explosion Problem

A dual-mode CLI tool with shared memory and revolutionary token optimization, built on Mistral Vibe.

✨ Now with full Windows support! Works on macOS, Linux, and Windows 10/11.

📑 Table of Contents

🏆 Why HiveTerminal Stands Out

The Problem We Solved

Every stateless AI API has a critical flaw: They require sending the entire conversation history with every request. For agentic coding sessions, this creates an exponential token explosion:

Turn 1: 1,000 tokens
Turn 10: 50,000 tokens
Turn 50: 1,000,000 tokens
Turn 100: 2,000,000+ tokens

Real-world impact: A simple todo list task burned nearly 1M tokens in testing. At GPT-4 pricing, a 100-turn coding session costs $60 instead of $0.60.

Our Solution: 98% Token Reduction

HiveTerminal implements a two-phase token optimization system that reduces token usage by 98%:

Phase 1: Sliding Window (80-95% reduction)

Keeps only the last 5 conversation turns in history
System prompt always preserved
Configurable window size (3-20 turns)
Prevents exponential growth

Phase 2: Local State Management (Additional 50-80% reduction)

Stores working data locally instead of in chat history
Automatic state injection into context
Persistent across sessions
Zero configuration required

Combined Result:

Before: 1M tokens for 100-turn session ($30)
After: 20K tokens for same session ($0.60)
Savings: 98% reduction ($29.40 saved)

What Makes HiveTerminal Unique

Feature	HiveTerminal	Cursor	Aider	Windsurf	Cline
Token Optimization	✅ 98% reduction	❌ No optimization	❌ No optimization	❌ No optimization	❌ No optimization
Local State Management	✅ Automatic	❌ None	❌ None	❌ None	❌ None
Dual-Mode Operation	✅ Conversational + Spec	❌ Single mode	❌ Single mode	❌ Single mode	❌ Single mode
Shared Memory (Hive Mind)	✅ Git-shareable ChromaDB	❌ None	⚠️ Limited	❌ None	❌ None
100% Local (Ollama)	✅ Full offline support	❌ Cloud only	✅ Supported	❌ Cloud only	⚠️ Limited
Multi-Provider	✅ 8+ providers	⚠️ Limited	✅ Multiple	⚠️ Limited	⚠️ Limited
Terminal-Native	✅ Rich TUI	❌ VS Code extension	✅ CLI	❌ VS Code extension	❌ VS Code extension
Cost for Long Sessions	💰 $0.60 (100 turns) or Free for localy using LLM	💰 $60 (100 turns)	💰 $60 (100 turns)	💰 $60 (100 turns)	💰 $60 (100 turns)

Key Innovations

Token Optimization: Only terminal-based agentic tool that solves the token explosion problem
Dual-Mode Operation: Choose between conversational and spec-first workflows
Hive Mind Memory: Git-shareable vector database for team knowledge
Smart State Management: AI automatically stores and retrieves working data
True Local-First: 100% offline with Ollama, no cloud dependencies
Cost Effective: 98% cheaper for long coding sessions

What is HiveTerminal?

HiveTerminal is a terminal-based AI coding assistant that combines the flexibility of conversational workflows with the transparency of spec-first development. Built on top of Mistral Vibe, it adds three revolutionary innovations:

Token Optimization: 98% reduction in token usage through sliding window + local state
Dual-Mode Operation: Choose between conversational (Vibe Mode) and spec-first (Spec Mode) workflows
Hive Mind Memory: Shared ChromaDB vector database for intelligent code context retrieval

Key Features

🚀 Revolutionary Token Optimization (98% Reduction)

The Problem: Stateless AI APIs require sending entire conversation history with every request, causing exponential token growth that makes long coding sessions prohibitively expensive.

Our Solution: Two-phase optimization system that reduces token usage by 98%:

Phase 1: Sliding Window

What it does: Keeps only last 5 conversation turns in history
Token savings: 80-95% reduction
How it works: Automatically trims old messages while preserving system prompt
Configurable: Adjust window size (3-20 turns) based on your needs

Phase 2: Local State Management

What it does: Stores working data locally instead of in chat history
Token savings: Additional 50-80% on top of Phase 1
How it works: AI automatically stores todos, file lists, and work items locally
Automatic: State injected into context when needed, persists across sessions

Real-World Impact:

Example: 100-turn coding session

WITHOUT optimization:
- Turn 1:    1,000 tokens
- Turn 50:   1,000,000 tokens  
- Turn 100:  2,000,000 tokens
- Cost:      $60 (at GPT-4 pricing)

WITH HiveTerminal:
- Turn 1:    1,000 tokens
- Turn 50:   10,000 tokens
- Turn 100:  20,000 tokens
- Cost:      $0.60 (at GPT-4 pricing)

SAVINGS: $59.40 (98% reduction)

Why This Matters:

✅ Long coding sessions become affordable
✅ No more hitting context limits
✅ Faster responses (less data to process)
✅ Works with any AI provider
✅ Completely automatic and transparent

See docs/STATE_MANAGEMENT_QUICK_START.md for details.

🔄 Dual-Mode Operation

Vibe Mode (Conversational)

Tool-by-tool execution with individual approvals
Iterative, flexible workflow
Best for exploration and quick tasks
Approve each action as the agent works

Spec Mode (Three-Phase)

Plan → Approve → Execute workflow
Single approval for complete plan
Structured, transparent approach
Best for complex, multi-step tasks

Why It's Unique: Most agentic coding tools force you into one workflow. HiveTerminal lets you choose based on the task, and switch anytime with /mode.

🧠 Hive Mind Memory System

Automatic Indexing: Code changes are automatically embedded and stored
Semantic Search: Retrieves relevant context for every request
Git-Shareable: Team members share knowledge through version control
ChromaDB-Powered: Fast, local vector database with no external dependencies

Why It's Unique: Unlike other tools that rely on simple file reading, HiveTerminal builds a semantic understanding of your codebase that your entire team can share through Git.

🌐 Multiple LLM Providers

HiveTerminal supports a wide range of AI providers - no vendor lock-in:

Xiaomi Mimo - Free 309B parameter model with 256K context (Recommended for coding)
OpenAI - GPT-4, GPT-4o, and other models
Anthropic - Claude models
OpenRouter - Access to Mistral, Llama, and more
Groq - Fast inference
GitHub Models - Free tier available
Google AI Studio - Gemini models
Hugging Face - Open-source models
Ollama - 100% local, offline models

Why It's Unique: Switch providers anytime without changing your workflow. Use free models for development, premium models for production.

🌐 Local-First with Ollama

100% Local: Run completely offline with Ollama
Privacy-Focused: Your code never leaves your machine
No API Costs: Free to use with local models
Multiple Models: Support for Qwen, Llama, Mistral, and more

Why It's Unique: True offline capability with full feature parity. Most "local" tools still require cloud services for key features.

⚡ Xiaomi Mimo v2 Flash (Recommended)

Free API: No cost during beta period
High Performance: 309B parameter MoE model with 15B active
256K Context: Massive context window for large codebases
#1 on SWE-bench: 73.4% score, best for coding tasks
Fast Inference: 150 tokens/sec
Get API Key: https://platform.xiaomimimo.com/#/console/api-keys

🎨 Familiar Interface

Maintains Vibe's complete UI and UX
Rich terminal formatting with syntax highlighting
Interactive prompts and progress indicators
No learning curve if you know Vibe

🎯 Competitive Advantages

vs. Cursor, Windsurf, Cline (VS Code Extensions)

✅ 98% cheaper for long sessions (token optimization)
✅ Terminal-native - no IDE dependency
✅ Dual-mode - conversational + spec-first
✅ True offline - works without internet
✅ Git-shareable memory - team knowledge base

vs. Aider (CLI Tool)

✅ 98% cheaper for long sessions (token optimization)
✅ Rich TUI - not just plain text
✅ Dual-mode - conversational + spec-first
✅ Shared memory - semantic code understanding
✅ State management - automatic data persistence

vs. Mistral Vibe (Our Foundation)

✅ Token optimization - 98% reduction
✅ Spec Mode - structured workflows
✅ Hive Mind - shared memory system
✅ Multi-provider - not just Mistral
✅ State management - local data storage
✅ 100% compatible - all Vibe features work

The Bottom Line

HiveTerminal is the only terminal-based agentic coding assistant that:

Solves the token explosion problem (98% reduction)
Offers dual-mode operation (conversational + spec-first)
Provides git-shareable team memory
Works 100% offline with full features
Supports 8+ AI providers with no lock-in

Cost Comparison (100-turn coding session):

Cursor/Windsurf/Cline/Aider: $60
HiveTerminal: $0.60
You save: $59.40 per session

Quick Start

One-Line Installation

macOS & Linux

curl -fsSL https://raw.githubusercontent.com/HyphaeAI/HiveTerminal/main/install.sh | bash

Windows (PowerShell)

irm https://raw.githubusercontent.com/HyphaeAI/HiveTerminal/main/install.ps1 | iex

What the installer does:

✅ Clone HiveTerminal
✅ Set up Python virtual environment
✅ Install all dependencies
✅ Add hive command to your PATH
✅ Run initial setup

After installation, you'll need to:

Install Ollama (for local AI models)
Download an AI model
Restart your terminal/PowerShell

Installing Ollama & AI Models

1. Install Ollama

macOS:

brew install ollama

Linux:

curl -fsSL https://ollama.ai/install.sh | sh

Windows:

Download from: https://ollama.ai/download/windows
Run the installer
Ollama will start automatically

Or visit: https://ollama.ai

2. Choose Your AI Model

For Best Results (Recommended):

# Best for coding with tool calling (7B parameters)
ollama pull qwen2.5-coder:7b        # 4.7GB download

# Alternative: Fast and capable
ollama pull deepseek-coder:6.7b     # 3.8GB download

For Faster/Smaller Systems:

# Balanced performance (3B parameters)
ollama pull qwen2.5:3b              # 1.9GB download

# Fastest but limited (1.5B parameters - may struggle with tools)
ollama pull qwen2.5-coder:1.5b      # 986MB download

⚠️ Important: Models smaller than 7B may have issues with tool calling (outputting raw JSON instead of proper responses). For best experience, use 7B+ models.

📖 Detailed model comparison: See docs/MODEL_GUIDE.md for comprehensive model selection guide.

3. Start HiveTerminal

macOS & Linux:

# Restart your terminal, then:
hive

# Follow the setup prompts:
# 1. Select "Ollama" as your provider
# 2. Enter your model name (e.g., "qwen2.5-coder:7b")
# 3. Choose your preferred mode

Windows:

# Restart PowerShell, then:
hive

# Follow the setup prompts:
# 1. Select "Ollama" as your provider
# 2. Enter your model name (e.g., "qwen2.5-coder:7b")
# 3. Choose your preferred mode

Model Comparison

Model	Size	RAM Needed	Tool Calling	Speed	Best For
qwen2.5-coder:7b	4.7GB	8GB+	✅ Excellent	Medium	Production use, complex tasks
deepseek-coder:6.7b	3.8GB	8GB+	✅ Excellent	Fast	Coding, fast responses
qwen2.5:3b	1.9GB	4GB+	⚠️ Limited	Fast	Simple tasks, testing
qwen2.5-coder:1.5b	986MB	2GB+	❌ Poor	Very Fast	Quick tests only

Manual Installation

macOS & Linux

# 1. Install Ollama
# macOS:
brew install ollama

# Linux:
curl -fsSL https://ollama.ai/install.sh | sh

# 2. Clone the repository
git clone https://github.com/HyphaeAI/HiveTerminal.git
cd hiveterminal

# 3. Create virtual environment (Python 3.10+ required, 3.13 recommended)
python3 -m venv .venv
source .venv/bin/activate  # On Mac/Linux

# 4. Install HiveTerminal and Vibe in editable mode
pip install -e .
pip install -e Vibe/

# 5. Download an AI model (choose based on your needs)
# Recommended for best results:
ollama pull qwen2.5-coder:7b        # 4.7GB - Best for coding

# Or alternatives:
ollama pull deepseek-coder:6.7b     # 3.8GB - Fast & capable
ollama pull qwen2.5:3b              # 1.9GB - Balanced
ollama pull qwen2.5-coder:1.5b      # 986MB - Fastest (limited)

# 6. Add to PATH (choose your shell config file)
echo 'export PATH="$HOME/hiveterminal:$PATH"' >> ~/.zshrc   # for zsh
# or
echo 'export PATH="$HOME/hiveterminal:$PATH"' >> ~/.bashrc  # for bash

# 7. Create alias for easy access
echo 'alias hive="cd ~/hiveterminal && source .venv/bin/activate && python -m hiveterminal.cli.entrypoint"' >> ~/.zshrc
# or for bash:
echo 'alias hive="cd ~/hiveterminal && source .venv/bin/activate && python -m hiveterminal.cli.entrypoint"' >> ~/.bashrc

# 8. Reload shell configuration
source ~/.zshrc  # or source ~/.bashrc

# 9. Run setup
hive --setup

Windows

Prerequisites:

Python 3.10+ (Download)
- During installation, check "Add Python to PATH"
Git for Windows (Download)
Ollama (Download)

Installation Steps:

# 1. Install Ollama
# Download and install from: https://ollama.ai/download/windows

# 2. Clone the repository
git clone https://github.com/HyphaeAI/HiveTerminal.git
cd hiveterminal

# 3. Create virtual environment
python -m venv .venv
.venv\Scripts\Activate.ps1

# 4. Install HiveTerminal and Vibe
pip install -e .
pip install -e Vibe\

# 5. Download an AI model
# Recommended for best results:
ollama pull qwen2.5-coder:7b        # 4.7GB - Best for coding

# Or alternatives:
ollama pull deepseek-coder:6.7b     # 3.8GB - Fast & capable
ollama pull qwen2.5:3b              # 1.9GB - Balanced
ollama pull qwen2.5-coder:1.5b      # 986MB - Fastest (limited)

# 6. Create wrapper script
@"
@echo off
cd /d "$PWD"
call .venv\Scripts\activate.bat
python -m hiveterminal.cli.entrypoint %*
"@ | Out-File -FilePath hive.bat -Encoding ASCII

# 7. Add to PATH (run as Administrator)
$currentPath = [Environment]::GetEnvironmentVariable("Path", "User")
[Environment]::SetEnvironmentVariable("Path", "$currentPath;$PWD", "User")

# 8. Restart PowerShell and run setup
hive --setup

First Run

# If you used the one-line installer, just type:
hive

# If you installed manually, activate the environment first:
cd ~/hiveterminal
source .venv/bin/activate
python -m hiveterminal.cli.entrypoint

# First time: You'll see onboarding to select your LLM provider
# Choose Ollama and enter your model name (e.g., qwen2.5-coder:7b)

# After setup, you'll see mode selection:
# 1. Conversational Mode (Flexible, Interactive)
# 2. Spec-First Mode (Structured, Transparent)

# Or specify mode directly:
hive --mode conversational  # Conversational mode
hive --mode spec            # Spec-first mode

# Run setup again anytime:
hive --setup

Adding to PATH (Manual Installation)

To use hive from anywhere, add it to your PATH:

For zsh (macOS default):

# Add to ~/.zshrc
echo 'alias hive="cd ~/hiveterminal && source .venv/bin/activate && python -m hiveterminal.cli.entrypoint"' >> ~/.zshrc
source ~/.zshrc

For bash (Linux default):

# Add to ~/.bashrc
echo 'alias hive="cd ~/hiveterminal && source .venv/bin/activate && python -m hiveterminal.cli.entrypoint"' >> ~/.bashrc
source ~/.bashrc

Alternative: Create a wrapper script

# Create wrapper script
cat > ~/hiveterminal/hive << 'EOF'
#!/bin/bash
cd ~/hiveterminal
source .venv/bin/activate
python -m hiveterminal.cli.entrypoint "$@"
EOF

# Make it executable
chmod +x ~/hiveterminal/hive

# Add to PATH
echo 'export PATH="$HOME/hiveterminal:$PATH"' >> ~/.zshrc  # or ~/.bashrc
source ~/.zshrc  # or source ~/.bashrc

Now you can run hive from any directory!

Usage Examples

Conversational Mode Example

You: Refactor the login function to use async/await

Agent: I'll read the file first
→ [Tool: read_file] → You approve

Agent: Now I'll make the changes
→ [Tool: write_file] → You approve

Agent: Let me run the tests
→ [Tool: bash] → You approve

✓ Done! Login function refactored.

Spec Mode Example

You: Refactor the login function to use async/await

Agent: Here's my execution plan:

# Execution Plan

## Summary
Refactor login.py to use async/await pattern

## Actions
1. Read login.py
2. Refactor function to async
3. Update all callers
4. Run test suite

## Files Affected
- login.py (Modify)
- auth_handler.py (Modify)

Do you want to proceed? (y/n)

You: y

Agent: Executing plan...
✓ Step 1/4: Read login.py
✓ Step 2/4: Refactored to async
✓ Step 3/4: Updated callers
✓ Step 4/4: Tests passing

✓ Done! All changes applied successfully.

Configuration

HiveTerminal uses a TOML configuration file at ~/.vibe/config.toml:

# Active model
active_model = "qwen2.5-coder:7b"

# Ollama Provider
[[providers]]
name = "ollama"
api_base = "http://localhost:11434/v1"
api_key_env_var = ""
backend = "generic"

# Model Configuration
[[models]]
name = "qwen2.5-coder:7b"
provider = "ollama"
alias = "qwen2.5-coder:7b"
temperature = 0.2
input_price = 0.0
output_price = 0.0

# Memory Configuration
[memory]
database_path = "./.hive_memory"
embedding_model = "text-embedding-ada-002"  # or "all-MiniLM-L6-v2" for local
chunk_size = 1000
top_k_results = 5

Note: For best results with tool calling, use models with 7B+ parameters. Smaller models (1.5B-3B) may struggle with complex tool interactions.

Memory System

How It Works

Ingestion: When you modify files, HiveTerminal automatically:
- Splits code into semantic chunks
- Generates embeddings
- Stores in local ChromaDB
Retrieval: For every request, HiveTerminal:
- Searches for relevant code context
- Injects top results into the agent's prompt
- Provides informed, context-aware responses
Sharing: The .hive_memory directory can be:
- Committed to Git
- Shared with team members
- Merged across branches

Memory Commands

# View memory statistics
hive --memory-stats

# Rebuild memory database
hive --rebuild-memory

CLI Reference

# Start interactive session
hive

# Run onboarding/setup
hive --setup

# Specify mode
hive --mode conversational  # Conversational mode
hive --mode spec            # Spec-first mode

# Memory management
hive --memory-stats         # Show memory statistics
hive --rebuild-memory       # Rebuild vector database

# Help
hive --help
hive --version

When to Use Each Mode

Use Vibe Mode (Conversational) When:

Exploring unfamiliar code
Making quick, iterative changes
Experimenting with different approaches
You want fine-grained control over each action
The task is simple or exploratory

Use Spec Mode (Three-Phase) When:

Making complex, multi-file changes
You want to review the complete plan first
Transparency is important
The task has multiple dependent steps
You want batch execution after approval

Switch Modes Anytime

# In any session, type:
/mode

# You'll see the mode selection prompt again

What Makes HiveTerminal Different?

The Token Optimization Breakthrough

Every other agentic coding tool has the same problem: They send the entire conversation history with every API request, causing exponential token growth.

HiveTerminal is the first to solve this with a two-phase optimization system:

Sliding Window (Phase 1): Keeps only recent conversation turns
Local State (Phase 2): Stores working data locally, not in history

Result: 98% token reduction, making long coding sessions affordable.

vs. Vibe (Our Foundation)

✅ Adds token optimization (98% reduction)
✅ Adds Spec Mode for structured workflows
✅ Adds Hive Mind memory system
✅ Adds multi-provider support (OpenAI, Anthropic, Ollama, etc.)
✅ Adds state management for data persistence
✅ Keeps 100% of Vibe's UI and features
✅ Maintains full compatibility with Vibe's tools and MCP

vs. Other AI Coding Assistants

Cursor, Windsurf, Cline (VS Code Extensions):

❌ No token optimization (expensive for long sessions)
❌ IDE-dependent (requires VS Code)
❌ Single workflow mode
❌ No shared memory system
❌ Limited offline support

Aider (CLI Tool):

❌ No token optimization (expensive for long sessions)
❌ Plain text interface (no rich TUI)
❌ Single workflow mode
❌ No semantic memory system
❌ No state management

HiveTerminal (This Project):

✅ 98% token reduction (affordable long sessions)
✅ Terminal-native with rich TUI
✅ Dual-mode operation (conversational + spec-first)
✅ Git-shareable semantic memory
✅ Automatic state management
✅ 100% offline capable
✅ 8+ AI providers supported
✅ No vendor lock-in

Why This Matters for Judges

Innovation: First terminal-based agentic coding assistant to solve the token explosion problem that plagues all stateless AI APIs.

Impact: Makes long coding sessions 98% cheaper, democratizing access to AI-powered development.

Technical Merit: Novel two-phase optimization (sliding window + local state) that's automatic and transparent.

Practical Value: Real-world testing showed 1M token reduction on a simple todo list task - from $30 to $0.60 per session.

Completeness: Not just a proof of concept - production-ready with comprehensive testing, documentation, and multi-platform support.

Architecture

Token Optimization Architecture

┌─────────────────────────────────────────────────────────┐
│                    User Message                         │
└────────────────────────┬────────────────────────────────┘
                         │
┌────────────────────────▼────────────────────────────────┐
│              State Manager (Phase 2)                    │
│  • Load session state from .hive_state/                │
│  • Inject state context into message                   │
│  • State: todos, file lists, work items, etc.         │
└────────────────────────┬────────────────────────────────┘
                         │
┌────────────────────────▼────────────────────────────────┐
│            Sliding Window (Phase 1)                     │
│  • Keep only last 5 conversation turns                 │
│  • Preserve system prompt                              │
│  • Trim old messages automatically                     │
└────────────────────────┬────────────────────────────────┘
                         │
┌────────────────────────▼────────────────────────────────┐
│                  AI Processing                          │
│  • Process with optimized context                      │
│  • Use state_tool to manage local data                │
│  • Generate response                                   │
└────────────────────────┬────────────────────────────────┘
                         │
┌────────────────────────▼────────────────────────────────┐
│              State Persistence                          │
│  • Save updated state to disk                          │
│  • Persist across sessions                             │
│  • Available for next turn                             │
└─────────────────────────────────────────────────────────┘

Token Usage Comparison:
┌──────────────────────────────────────────────────────────┐
│ WITHOUT Optimization (Turn 50):                          │
│ [System] [Turn 1] [Turn 2] ... [Turn 50] = 1,000,000 tokens │
│                                                          │
│ WITH Optimization (Turn 50):                            │
│ [System] [State Context] [Turn 46-50] = 10,000 tokens  │
│                                                          │
│ SAVINGS: 99% reduction                                  │
└──────────────────────────────────────────────────────────┘

System Architecture

┌─────────────────────────────────────────┐
│         CLI Layer (main.py)             │
│     --mode [vibe|spec] flag             │
└──────────────────┬──────────────────────┘
                   │
┌──────────────────▼──────────────────────┐
│      TUI Layer (Vibe's Textual UI)      │
│         (NO VISUAL CHANGES)             │
└──────────────────┬──────────────────────┘
                   │
            ┌──────┴──────┐
            │             │
┌───────────▼──┐    ┌─────▼────────────┐
│  Vibe Mode   │    │   Spec Mode      │
│(Conversational)   │ (Three-Phase)    │
└──────┬───────┘    └──────┬───────────┘
       │                   │
       └────────┬──────────┘
                │
┌───────────────▼──────────────────────┐
│     Shared Components (Vibe)         │
│  Tool Manager | Config | LLM Backend │
└───────────────┬──────────────────────┘
                │
┌───────────────▼──────────────────────┐
│   Memory Manager (ChromaDB)          │
│   - Code Ingestion & Chunking        │
│   - Semantic Retrieval               │
│   - Git-Shareable Hive Mind          │
└──────────────────────────────────────┘

Project Structure

hiveterminal/
├── core/           # Core functionality and LLM backend
├── cli/            # Command-line interface
├── memory/         # Memory management system
├── agents/         # Agent implementations
└── prompts/        # System prompts

.hive_memory/       # Vector database (created at runtime)
.hive_backups/      # File backups (created at runtime)
.hive_logs/         # Log files (created at runtime)
.hive_config.toml   # Configuration file (optional)

Requirements

All Platforms

Python 3.10+ (3.13 recommended for ChromaDB compatibility)
Git
Ollama (install from https://ollama.ai)

System Requirements by Model

Model	Download	RAM	Disk	Tool Calling
7B models	4-5GB	8GB+	10GB+	✅ Excellent
3B models	2GB	4GB+	5GB+	⚠️ Limited
1.5B models	1GB	2GB+	3GB+	❌ Poor

Platform-Specific

macOS:

macOS 10.15 (Catalina) or later
Homebrew (recommended for easy installation)

Linux:

Ubuntu 20.04+, Debian 11+, Fedora 35+, or equivalent
systemd (for Ollama service management)

Windows:

Windows 10 (version 1809) or later
Windows 11 (recommended)
PowerShell 5.1 or later (comes with Windows)
Administrator access (for PATH modification)

Choosing the Right Model

For Production Use

qwen2.5-coder:7b (4.7GB) - Best overall, excellent tool calling
deepseek-coder:6.7b (3.8GB) - Fast and capable alternative

For Development/Testing

qwen2.5:3b (1.9GB) - Good balance of speed and capability
qwen2.5-coder:1.5b (986MB) - Fast but limited, may struggle with tools

System Requirements by Model

Model	Download	RAM	Disk	Tool Calling
7B models	4-5GB	8GB+	10GB+	✅ Excellent
3B models	2GB	4GB+	5GB+	⚠️ Limited
1.5B models	1GB	2GB+	3GB+	❌ Poor

Note: Smaller models (< 7B) may output raw JSON instead of proper responses and struggle with complex tool interactions.

Troubleshooting

`hive: command not found` (macOS/Linux) or `'hive' is not recognized` (Windows)

Solution 1: Restart your terminal/PowerShell after installation

Solution 2: Manually source your shell config (macOS/Linux):

source ~/.zshrc  # for zsh
# or
source ~/.bashrc  # for bash

Solution 3 (Windows): Check if PATH was updated:

$env:Path -split ';' | Select-String "hiveterminal"

If not found, add manually:

# Run as Administrator
$currentPath = [Environment]::GetEnvironmentVariable("Path", "User")
[Environment]::SetEnvironmentVariable("Path", "$currentPath;$env:USERPROFILE\.hiveterminal", "User")

Solution 4: Check if the alias was added (macOS/Linux):

grep "hive" ~/.zshrc  # or ~/.bashrc

Ollama connection errors

Check if Ollama is running:

# macOS/Linux:
ollama list

# Windows:
ollama list

Start Ollama service:

macOS:

brew services start ollama

Linux:

ollama serve &

Windows:

Ollama runs as a background service automatically
Check system tray for Ollama icon
If not running, start from Start Menu: "Ollama"

Model not found (404 error)

List available models:

ollama list

Pull the model:

ollama pull qwen2.5-coder:7b

Make sure model name matches exactly (including tag):

# Wrong: qwen2.5-coder
# Right: qwen2.5-coder:7b

Small model issues (tool calling problems)

If using models smaller than 7B (like 1.5B or 3B), you may experience:

Raw JSON output instead of proper responses
Inappropriate tool calls
Confusion about when to use tools

Solution: Use a larger model (7B+ recommended):

ollama pull qwen2.5-coder:7b
hive --setup  # Select the new model

Python version issues

Check Python version:

# macOS/Linux:
python3 --version

# Windows:
python --version

Install Python 3.13 (recommended):

macOS:

brew install python@3.13

Linux (Ubuntu/Debian):

sudo apt update
sudo apt install python3.13 python3.13-venv

Windows:

Download from: https://www.python.org/downloads/
During installation, check "Add Python to PATH"
Restart PowerShell after installation

Windows-Specific Issues

PowerShell Execution Policy Error:

cannot be loaded because running scripts is disabled

Solution:

# Run as Administrator
Set-ExecutionPolicy -ExecutionPolicy RemoteSigned -Scope CurrentUser

Python not found after installation (Windows):

Solution: Add Python to PATH manually:

Search for "Environment Variables" in Start Menu
Click "Environment Variables"
Under "User variables", select "Path" and click "Edit"
Click "New" and add: C:\Users\YourUsername\AppData\Local\Programs\Python\Python313
Click "New" and add: C:\Users\YourUsername\AppData\Local\Programs\Python\Python313\Scripts
Click "OK" and restart PowerShell

Git not found (Windows):

Solution: Install Git for Windows:

Download from: https://git-scm.com/download/win
Use default installation options
Restart PowerShell after installation

Uninstalling HiveTerminal

If you need to completely remove HiveTerminal from your system:

One-Line Uninstall

macOS & Linux

curl -fsSL https://raw.githubusercontent.com/HyphaeAI/HiveTerminal/main/uninstall.sh | bash

Or if you have the repository:

cd ~/hiveterminal && ./uninstall.sh

Windows (PowerShell)

irm https://raw.githubusercontent.com/HyphaeAI/HiveTerminal/main/uninstall.ps1 | iex

Or if you have the repository:

cd ~\hiveterminal
.\uninstall.ps1

What Gets Removed

The uninstaller will remove:

✓ Installation directory (~/hiveterminal or ~/.hiveterminal)
✓ Configuration files (~/.vibe/)
✓ Environment files (~/.vibe/.env)
✓ Memory database (.hive_memory/)
✓ Backups and logs (.hive_backups/, .hive_logs/)
✓ Shell aliases and PATH entries
✓ Environment variables (XIAOMI_MIMO_API_KEY, etc.)
✓ Wrapper scripts (hive command)

Manual Uninstall

If you prefer to uninstall manually:

macOS & Linux Manual Uninstall

# 1. Remove installation directory
rm -rf ~/hiveterminal
# or
rm -rf ~/.hiveterminal

# 2. Remove configuration
rm -rf ~/.vibe

# 3. Remove memory database (if in project directory)
rm -rf .hive_memory .hive_backups .hive_logs

# 4. Remove shell aliases
# Edit ~/.zshrc or ~/.bashrc and remove lines containing:
# - hiveterminal
# - alias hive=
# - XIAOMI_MIMO_API_KEY

# 5. Reload shell
source ~/.zshrc  # or source ~/.bashrc

Windows Manual Uninstall

# 1. Remove installation directory
Remove-Item -Path "$env:USERPROFILE\hiveterminal" -Recurse -Force

# 2. Remove configuration
Remove-Item -Path "$env:USERPROFILE\.vibe" -Recurse -Force

# 3. Remove memory database (if in project directory)
Remove-Item -Path ".hive_memory" -Recurse -Force
Remove-Item -Path ".hive_backups" -Recurse -Force
Remove-Item -Path ".hive_logs" -Recurse -Force

# 4. Remove environment variables (Run as Administrator)
[Environment]::SetEnvironmentVariable("XIAOMI_MIMO_API_KEY", $null, "User")

# 5. Remove from PATH (Run as Administrator)
$currentPath = [Environment]::GetEnvironmentVariable("Path", "User")
$newPath = ($currentPath -split ';' | Where-Object { $_ -notlike "*hiveterminal*" }) -join ';'
[Environment]::SetEnvironmentVariable("Path", $newPath, "User")

# 6. Restart PowerShell

After Uninstall

After uninstalling:

Restart your terminal/PowerShell
Verify removal: hive --version should show "command not found"
Check that ~/.vibe/ directory is gone
Optionally remove Ollama if you don't need it:
- macOS: brew uninstall ollama
- Linux: sudo rm -rf /usr/local/bin/ollama
- Windows: Uninstall from Settings → Apps

Contributing

We welcome contributions! Please see CONTRIBUTING.md for guidelines.

License

[License information to be added]

Acknowledgments

Built on top of Mistral Vibe - a powerful terminal-based agentic coding assistant.

Support

Issues: GitHub Issues
Discussions: GitHub Discussions
Documentation: Full Documentation

Ready to get started? Run hive and choose your workflow mode!

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
.hive_memory		.hive_memory
Vibe		Vibe
docs		docs
hiveterminal		hiveterminal
tests		tests
.gitignore		.gitignore
.hive_config.toml		.hive_config.toml
CHANGELOG.md		CHANGELOG.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
PROJECT_STRUCTURE.md		PROJECT_STRUCTURE.md
README.md		README.md
install.ps1		install.ps1
install.sh		install.sh
pyproject.toml		pyproject.toml
uninstall.ps1		uninstall.ps1
uninstall.sh		uninstall.sh

Folders and files

Latest commit

History

Repository files navigation

HiveTerminal

📑 Table of Contents

Quick Start

Features & Innovation

Usage & Configuration

Technical Details

Installation & Setup

Support

🏆 Why HiveTerminal Stands Out

The Problem We Solved

Our Solution: 98% Token Reduction

Phase 1: Sliding Window (80-95% reduction)

Phase 2: Local State Management (Additional 50-80% reduction)

What Makes HiveTerminal Unique

Key Innovations

What is HiveTerminal?

Key Features

🚀 Revolutionary Token Optimization (98% Reduction)

Phase 1: Sliding Window

Phase 2: Local State Management

🔄 Dual-Mode Operation

🧠 Hive Mind Memory System

🌐 Multiple LLM Providers

🌐 Local-First with Ollama

⚡ Xiaomi Mimo v2 Flash (Recommended)

🎨 Familiar Interface

🎯 Competitive Advantages

vs. Cursor, Windsurf, Cline (VS Code Extensions)

vs. Aider (CLI Tool)

vs. Mistral Vibe (Our Foundation)

The Bottom Line

Quick Start

One-Line Installation

macOS & Linux

Windows (PowerShell)

Installing Ollama & AI Models

1. Install Ollama

2. Choose Your AI Model

3. Start HiveTerminal

Model Comparison

Manual Installation

First Run

Adding to PATH (Manual Installation)

Usage Examples

Conversational Mode Example

Spec Mode Example

Configuration

Memory System

How It Works

Memory Commands

CLI Reference

When to Use Each Mode

Use Vibe Mode (Conversational) When:

Use Spec Mode (Three-Phase) When:

Switch Modes Anytime

What Makes HiveTerminal Different?

The Token Optimization Breakthrough

vs. Vibe (Our Foundation)

vs. Other AI Coding Assistants

Why This Matters for Judges

Architecture

Token Optimization Architecture

System Architecture

Project Structure

Requirements

All Platforms

System Requirements by Model

Platform-Specific

Choosing the Right Model

For Production Use

For Development/Testing

System Requirements by Model

Troubleshooting

hive: command not found (macOS/Linux) or 'hive' is not recognized (Windows)

Ollama connection errors

Model not found (404 error)

`hive: command not found` (macOS/Linux) or `'hive' is not recognized` (Windows)

Packages