SafeClaw -- AI Agent Context

Security-first personal AI coding assistant with zero-trust architecture.

What This Project Is

SafeClaw is an AI coding assistant with multi-provider LLM support (GitHub Copilot, OpenAI, Anthropic), mandatory OS-level sandboxing, capability-based access control, encrypted secret storage, and signed skill verification. Every tool execution is sandboxed via Linux kernel features (Landlock, seccomp-BPF, namespaces). There is no way to disable security enforcement -- it is structural.

The target user is an individual developer who wants AI-assisted coding with strong guarantees against prompt injection, malicious tool calls, and data exfiltration.

Linux and macOS. Node.js >= 22. pnpm 9+.

Repository Structure

safeclaw/
├── packages/           # pnpm monorepo workspace packages
│   ├── vault/          # @safeclaw/vault -- AES-256-GCM encrypted key-value store
│   ├── sandbox/        # @safeclaw/sandbox -- OS-level process sandboxing wrapper
│   ├── core/           # @safeclaw/core -- agent runtime, capabilities, tools, sessions, copilot client
│   ├── gateway/        # @safeclaw/gateway -- HTTP server with auth + rate limiting
│   ├── webchat/        # @safeclaw/webchat -- browser chat SPA + static file server
│   └── cli/            # @safeclaw/cli -- CLI entry point (top of dependency tree)
├── native/             # C11 sandbox helper binary (musl-gcc, statically linked)
├── skills/             # Builtin skill manifests (Ed25519-signed)
├── test/               # Cross-cutting security tests
├── docs/               # Architecture, security model, sandboxing, skills docs
│   └── plans/          # Design documents and implementation plans
└── scripts/            # Build/release scripts

Package Dependency Graph

vault (standalone)     sandbox (standalone)
      \                    |
       \                   v
        +-----> core <-----+
               / | \
              /  |  \
             v   v   v
        gateway webchat cli (depends on all)

Key Architectural Concepts

Agent Loop

packages/core/src/agent/agent.ts -- Multi-round tool-calling loop. Sends messages to the configured model provider, receives tool call requests, executes them through the ToolOrchestrator, feeds results back. Continues until the model produces a final text response.

Model Providers

packages/core/src/providers/types.ts -- ModelProvider interface (common chat() and chatStream() methods)
packages/core/src/providers/copilot.ts -- CopilotProvider wraps existing CopilotClient
packages/core/src/providers/openai.ts -- OpenAIProvider uses native fetch against OpenAI API
packages/core/src/providers/anthropic.ts -- AnthropicProvider translates OpenAI wire format to/from Anthropic Messages API
packages/core/src/providers/registry.ts -- ProviderRegistry manages available providers
Provider selection is vault-driven: vault.get("provider") returns "copilot" (default), "openai", or "anthropic"

Capability System

packages/core/src/capabilities/registry.ts -- Tracks which skills have which capabilities
packages/core/src/capabilities/enforcer.ts -- Checks every tool call against granted capabilities at runtime
packages/core/src/capabilities/verifier.ts -- Ed25519 signature verification for skill manifests
8 capability types: fs:read, fs:write, net:http, net:https, process:spawn, env:read, secret:read, secret:write
Capabilities have constraints (e.g., allowed paths, allowed hosts)

Tool Orchestrator

packages/core/src/tools/orchestrator.ts -- Central tool execution pipeline:

Capability check (enforcer)
Sandbox execution (if available) or direct execution
Audit logging (timestamp, duration, result, sandbox status)

Builtin Tools

Located in packages/core/src/tools/builtin/:

read.ts, write.ts, edit.ts -- File operations
bash.ts -- Shell command execution; createBashTool(options?) factory accepts allowedCommandPaths for advisory command validation (warns when a binary is not under an allowed directory; Landlock is the real enforcement)
web-fetch.ts -- HTTP fetching
web-search.ts -- Web search via Brave Search API (conditionally included when brave_api_key exists in vault)
process.ts -- Background process management (start/status/log/kill/list)
apply-patch.ts -- Multi-file unified diff patching with atomic writes and fuzzy matching (parser in patch-parser.ts, applier in patch-applier.ts)

Each tool declares requiredCapabilities and implements a ToolHandler interface.

ProcessManager

packages/core/src/tools/process-manager.ts -- Tracks spawned child processes by UUID. Features: ring buffer output capture (1MB max per process), automatic cleanup after 1 hour, maximum 8 concurrent processes. Used by the process builtin tool.

Sandbox

packages/sandbox/src/sandbox.ts -- Wraps commands via @anthropic-ai/sandbox-runtime (SandboxManager.wrapWithSandbox()) as the outer layer; injects the C helper as the inner process via --policy-file <tmp> when found
packages/sandbox/src/policy-builder.ts -- PolicyBuilder class with fluent API; PolicyBuilder.forDevelopment(cwd, options?) creates a development-ready policy with allowlisted system paths, compiler toolchains (JVM, GCC), an expanded ~120 syscall allowlist, and support for extraExecutePaths/extraReadWritePaths via DevelopmentPolicyOptions; PolicyBuilder.toRuntimeConfig(policy) translates SandboxPolicy to SandboxRuntimeConfig for sandbox-runtime (write allowlist + credential dir denylist)
native/src/main.c -- C helper binary that applies: Landlock filesystem rules, seccomp-BPF syscall filtering, capability dropping, PR_SET_NO_NEW_PRIVS
Policy sent to helper via --policy-file <tmp> (JSON written to a temp file at mode 0o600; cleaned up after each execution)

Vault

packages/vault/src/vault.ts -- AES-256-GCM encrypted JSON file store. Keys derived via scrypt from passphrase or fetched from OS keyring (GNOME secret-tool). File permissions enforced at 0o600.

Channel Adapters

packages/core/src/channels/types.ts defines ChannelAdapter interface (connect, disconnect, onMessage, send). Two implementations:

packages/cli/src/adapter.ts -- readline-based terminal
packages/webchat/src/adapter.ts -- HTTP/SSE-based browser SPA

Gateway

packages/gateway/src/server.ts -- HTTP server with:

Bearer token auth (timing-safe comparison, min 32 chars)
Token bucket rate limiting per client IP
Single endpoint: POST /api/chat
Localhost-only binding

Entry Point and Bootstrap

The main entry point is packages/cli/src/cli.ts (registered as safeclaw binary).

Bootstrap flow (packages/cli/src/commands/bootstrap.ts):

Open vault (keyring or passphrase)
Read provider config from vault (provider key, defaults to "copilot")
Create appropriate ModelProvider (CopilotProvider, OpenAIProvider, or AnthropicProvider)
Load builtin skill manifest
Read brave_api_key from vault; if present, include web_search tool in tool registry
Create ProcessManager for background process tracking
Initialize SandboxManager network proxy (via PolicyBuilder.toRuntimeConfig())
Create: CapabilityRegistry -> CapabilityEnforcer -> ToolRegistry -> Sandbox -> ToolOrchestrator -> ContextCompactor -> Agent
Return { agent, sessionManager, capabilityRegistry, auditLog }

CLI commands: chat (default), onboard, audit, serve/server, doctor, help, version

Technology Stack

Aspect	Choice
Language	TypeScript (strict, ES2024 target)
Runtime	Node.js >= 22
Modules	ESM (`"type": "module"`, `"module": "Node16"`)
Package manager	pnpm 9+ with workspaces
Build	`tsc` with project references (composite builds)
Tests	Vitest 4.x with v8 coverage
Linter	OxLint 1.50+
Native code	C11, statically linked with musl-gcc
LLM API	Multi-provider: GitHub Copilot (device flow OAuth), OpenAI, Anthropic
Crypto	Node.js `crypto` -- AES-256-GCM, scrypt, Ed25519
CI	GitHub Actions

Development Commands

pnpm install          # Install dependencies
pnpm build            # Build all packages (tsc --build)
pnpm test             # Run all tests (vitest run)
pnpm lint             # Lint with oxlint
pnpm typecheck        # Type-check without emitting (tsc --build --dry)
pnpm bundle           # Create release bundle

# Native sandbox helper
make -C native        # Build (requires musl-tools)
make -C native check  # Run native tests

Testing Patterns

Co-located tests: *.test.ts files next to source files
Dependency injection: All external dependencies are injectable via constructor/function parameters for testability
Mocking: vi.mock() for module mocking, vi.fn() for function mocks
Security tests: Dedicated test/security/ directory with sandbox-escape, permission-escalation, crypto-validation, and auth-bypass tests
Native tests: Shell scripts + compiled C test binaries in native/test/
Vitest config: Module aliases map @safeclaw/* to source files (vitest.config.ts)

Data Storage

No database -- all runtime state is in-memory (audit log, capability registry)
Vault: JSON file on disk (~/.safeclaw/vault.json), AES-256-GCM encrypted, 0o600 permissions
Config: ~/.safeclaw/ directory (global user config)
Sessions: persisted to <cwd>/.safeclaw/sessions/ (directory-scoped, per-project)

Git Policy

Never commit or push changes. When work is complete (or at a logical stopping point), stop and provide ready-to-run git commit commands for all changes made during the session. If changes span multiple logical units, provide multiple commit commands in sequence so each commit is atomic and well-scoped. Always list every file touched and group them by commit. Example:

Ready to commit. Run:

git add packages/sandbox/src/types.ts packages/sandbox/src/detect.ts packages/sandbox/src/detect.test.ts && \
git commit -m "feat(sandbox): extend EnforcementLayers and KernelCapabilities types for bwrap"

git add packages/sandbox/src/policy-builder.ts packages/sandbox/src/policy-builder.test.ts && \
git commit -m "feat(sandbox): add toBwrapArgs() and selective home directory binding"

git add docs/sandboxing.md docs/security-model.md README.md AGENTS.md && \
git commit -m "docs: update documentation for bubblewrap sandbox integration"

If all changes are a single logical unit, a single commit is fine:

Ready to commit. Run:

git add -A && git commit -m "feat(tools): add JSON Schema parameters to all builtin tools"

The user will review and run the commands themselves. Do not run git add, git commit, git push, or any other git write operation.

Conventions

Commit style: Conventional Commits -- type(scope): description (e.g., feat(core): add agent runtime)
Scopes: core, cli, sandbox, vault, native, gateway, webchat, security, skills, tools, ci
Error handling: Fail-closed (deny by default), custom error classes per domain
Exports: Each package has src/index.ts barrel file re-exporting public API
No classes for data: Use TypeScript interfaces/types for data shapes, classes for stateful components
Security principle: Zero-trust, mandatory enforcement, no opt-out
Documentation updates: Whenever a feature is added, modified, or removed, update all relevant documentation files (README.md, AGENTS.md, docs/architecture.md, docs/getting-started.md, etc.) in the same changeset. Never leave documentation out of sync with the implementation.
Lint errors: All lint errors and warnings must be fixed before considering work complete. The GitHub CI workflow runs pnpm lint and will fail the build on any lint diagnostic. Never leave lint warnings as "pre-existing" or "to be ignored" -- fix them immediately.

Important Files to Read First

When getting oriented with this codebase, read these files in order:

docs/architecture.md -- Full system architecture with diagrams
docs/security-model.md -- Security philosophy and threat model
packages/core/src/agent/agent.ts -- The central agent loop
packages/core/src/tools/orchestrator.ts -- How tool calls flow
packages/core/src/capabilities/enforcer.ts -- How capabilities are checked
packages/cli/src/commands/bootstrap.ts -- How everything gets wired together
packages/sandbox/src/sandbox.ts -- How sandboxing works
native/src/main.c -- The native sandbox helper entry point

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SafeClaw -- AI Agent Context

What This Project Is

Repository Structure

Package Dependency Graph

Key Architectural Concepts

Agent Loop

Model Providers

Capability System

Tool Orchestrator

Builtin Tools

ProcessManager

Sandbox

Vault

Channel Adapters

Gateway

Entry Point and Bootstrap

Technology Stack

Development Commands

Testing Patterns

Data Storage

Git Policy

Conventions

Important Files to Read First

FilesExpand file tree

AGENTS.md

Latest commit

History

AGENTS.md

File metadata and controls

SafeClaw -- AI Agent Context

What This Project Is

Repository Structure

Package Dependency Graph

Key Architectural Concepts

Agent Loop

Model Providers

Capability System

Tool Orchestrator

Builtin Tools

ProcessManager

Sandbox

Vault

Channel Adapters

Gateway

Entry Point and Bootstrap

Technology Stack

Development Commands

Testing Patterns

Data Storage

Git Policy

Conventions

Important Files to Read First