Hugind

Local-only LLM inference and agent system. Uses llama.cpp as the engine, Rust as the implementation language. Aims to keep agents under control.

Server

OpenAI-compatible HTTP API backed by llama.cpp:

Continuous batching with stateful sessions
Context shifting (for supported models)
3-tier KV cache: VRAM, RAM, disk (uses the fastest available; loading from disk can be faster than re-processing a large history)
Multimodal (image + text)
Embeddings
Authentication + streaming

Each model runs with its own config, OS process, and port. A model can have several configs depending on the task.

Agent

Sandboxed agent execution with process isolation. Each agent runs in its own OS process.

Two runtimes:

JavaScript (rquickjs)
WASM (Wasmtime) for modules compiled from any language

Permission model (declared in agent.yaml):

Network: allowlist by domains/IPs, DNS-based rules, private network blocking
Filesystem: scoped read/write/create/delete, allowed/denied paths, WASM mounts
Shell: allowlist/blocklist, OS sandboxing (macOS sandbox-exec), timeouts, output limits
Resources (WASM): memory and CPU limits

Other:

MCP server support (declared per-agent under dependencies.mcp)
Agent install from git repos, local paths, or URLs (hugind agent install <path>)
Session logging under ~/.hugind/logs/agents/

Model

Download GGUF models from Hugging Face (with resume and SHA256 integrity verification)
Concurrent downloads
Local storage under ~/.hugind/models/{user}/{repo}/

Config

Hardware auto-detection (CPU, RAM, GPU)
Auto-fit support (engine adjusts context to fit available memory)
Hardware-aware defaults (GPU layers, threads, flash attention, KV offload)

Multi-Agent Orchestration

Task queue with dependency DAG (parallel execution of independent tasks)
Shared memory for cross-agent knowledge sharing
Inter-agent messaging (point-to-point + broadcast)
Coordinator pattern (auto-decompose goals into task graphs)
Agentic mode (LLM-driven tool-use loops)
Multi-model workflows (different agents use different Hugind servers)
Streaming events for progress tracking

Stdio bridge

NDJSON + MCP protocol for desktop/app integration (hugind stdio).

Quick start

Install

brew install hugind
hugind --version

Set Hugging Face token (if needed)

hugind config defaults --hf-token <your_token>

Download a model

hugind model add google/gemma-3-4b-it-qat-q4_0-gguf

Create a config

hugind config init gemma-4b

The wizard will:

probe your hardware (CPU, RAM, GPU)
let you select a downloaded model
ask whether to enable auto-fit or choose a context size manually

Config is written to ~/.hugind/configs/gemma-4b.yml.

Start the server

hugind server start gemma-4b

Send a request

curl http://localhost:8080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "gemma-4b",
    "messages": [{ "role": "user", "content": "who are you?" }]
  }'

See HTTP API for streaming, multimodal, sessions, embeddings, and state management.

Agent demo

hugind agent install https://github.com/netdur/hugind/tree/main/agent/audit

Requested permissions:
- Network access: No
- File access: Yes (actions: read; can access outside agent folder)
- Run system commands: No
> Grant these permissions and install this agent? Yes
Installed agent 'audit' to /Users/adel/.hugind/agents/audit

Run the audit agent against another agent's code:

hugind agent run audit agent/ocr

Checking server health at http://127.0.0.1:8080/v1/monitor...
Server is up. Starting agent...
{
  "Alignment": "PASS",
  "Security": "PASS",
  "Confidence": "high"
}

Run an agent with arguments:

hugind agent run agent/ocr --image /path/to/image.jpg --prompt "read the title"

{"blocks": [{"block_type": "text", "text": "...", "bbox_2d": [171, 133, 783, 223]}]}

Agent runs are logged:

[2026-02-10T12:29:33.526Z] agent.run.start name=ocr entry=main.js
[2026-02-10T12:29:33.530Z] host.fs.read_bytes path=/path/to/image.jpg
[2026-02-10T12:29:35.014Z] host.llm.chat_stream input=object messages=Some(1)
[2026-02-10T12:29:41.198Z] agent.run.complete status=ok

Multi-agent team

hugind agent team "Build a hello world python script" \
  --agents agent/ma-architect,agent/ma-developer,agent/ma-tester,agent/ma-reviewer

Team: ma-architect, ma-developer, ma-tester, ma-reviewer
Goal: Build a hello world python script

Decomposing goal into tasks...
Coordinator created 3 tasks:
  - Draft Hello World Script → ma-developer
  - Review Hello World Script → ma-reviewer (after: Draft Hello World Script)
  - Test Hello World Script → ma-tester (after: Draft Hello World Script)

  [task-0] Starting: Draft Hello World Script (ma-developer)
  [task-0] Completed: Draft Hello World Script
  [task-1] Starting: Review Hello World Script (ma-reviewer)
  [task-2] Starting: Test Hello World Script (ma-tester)
  [task-1] Completed: Review Hello World Script
  [task-2] Completed: Test Hello World Script

All tasks completed. Synthesizing result...

Agentic mode

Agents with mode: agentic register tools and a system prompt. The runtime drives the LLM tool-use loop automatically:

cd examples/hello-python
hugind agent run agent/ma-developer --prompt "Build a hello world python script"

# The agent writes the file in the current directory:
ls
hello_world.py

cat hello_world.py
print('Hello, World!')

Enable HUGIND_TRACE=1 for detailed execution tracing.

Docs

Agent Manifest -- agent.yaml format, permissions, dependencies
CLI Overview -- top-level commands
CLI: agent -- run, install, list, remove, team
CLI: config -- list, validate, init, defaults, info
CLI: model -- add, show, remove, migrate
CLI: server -- start, list, stop
HTTP API -- endpoints, request/response shapes, sessions
JavaScript Runtime -- execution model, globals, capabilities
Server Config -- config sections, defaults, path resolution
Skills -- installable instruction packages for agents
Stdio Bridge -- NDJSON + MCP protocol reference
Testing Guide -- local and CI test commands
Tutorial: Building Agentic Agents -- end-to-end guide
WASM Runtime -- execution lifecycle, hostcalls, resource limits
WASM SDK (AssemblyScript) -- hostcall wrappers

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 141 Commits
.github/workflows		.github/workflows
.vscode		.vscode
agent		agent
assets		assets
docs		docs
examples/hello-python		examples/hello-python
scripts		scripts
src		src
.gitignore		.gitignore
.mcp.json.example		.mcp.json.example
Cargo.lock		Cargo.lock
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
VISION.md		VISION.md
build.rs		build.rs
build.sh		build.sh
build_release.sh		build_release.sh
deploy.md		deploy.md
test_grammar.cpp		test_grammar.cpp
test_grammar.rs		test_grammar.rs

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hugind

Server

Agent

Model

Config

Multi-Agent Orchestration

Stdio bridge

Quick start

Install

Set Hugging Face token (if needed)

Download a model

Create a config

Start the server

Send a request

Agent demo

Multi-agent team

Agentic mode

Docs

License

About

Uh oh!

Releases 18

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Hugind

Server

Agent

Model

Config

Multi-Agent Orchestration

Stdio bridge

Quick start

Install

Set Hugging Face token (if needed)

Download a model

Create a config

Start the server

Send a request

Agent demo

Multi-agent team

Agentic mode

Docs

License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 18

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages