name

camera-claw

description

Security camera for your AI agent — sandbox, record, and monitor OpenClaw

version

2026.3.12

icon

assets/camera-claw-icon.png

entry

scripts/monitor.js

deploy

deploy.sh

requirements

docker

platforms

true

linux

macos

windows

parameters

name	label	type	default	description	group
auto_start	Auto Start	boolean	true	Start CameraClaw automatically when Aegis launches	Lifecycle

name	label	type	default	description	group
openclaw_version	OpenClaw Version	string	latest	Docker image tag or git ref for OpenClaw	Sandbox

name

label

type

options

default

description

group

recording_mode

Recording Mode

select

continuous

activity

manual

continuous

continuous = always record, activity = record on events, manual = user-triggered

Recording

name	label	type	min	max	default	description	group
clip_duration	Clip Duration (seconds)	number	30	600	300	Length of each recording clip. Clips stored in Aegis media directory.	Recording

name

label

type

options

default

description

group

snapshot_fps

Snapshot FPS

select

0.2

0.5

Periodic VNC snapshot rate. Lower = less CPU. Desktop changes slowly.

Recording

name	label	type	default	description	group
network_monitoring	Network Monitoring	boolean	true	Log all outbound network connections from OpenClaw	Monitoring

name	label	type	default	description	group
alert_unknown_connections	Alert: Unknown Connections	boolean	true	Flag connections to unrecognized IP addresses	Monitoring

name	label	type	min	max	default	description	group
screen_change_threshold	Screen Change Threshold (%)	number	5	80	20	Minimum % pixel change to trigger a screen_change event. Lower = more sensitive.	Monitoring

name

label

type

options

default

description

group

vlm_analysis

VLM Analysis

select

off

on_change

periodic

off

off = no VLM, on_change = on significant screen change, periodic = every N snapshots

Monitoring

name	label	type	min	max	default	description	group
vlm_interval	VLM Analysis Interval	number	5	300	60	Seconds between periodic VLM analyses (when vlm_analysis = periodic)	Monitoring

name	label	type	default	description	group
openclaw_config_dir	Config Directory	string	~/.openclaw	Path to OpenClaw config dir. Mounted into container.	OpenClaw

name	label	type	default	description	group	secret
openclaw_gateway_token	Gateway Token	string		Auth token for OpenClaw Control UI. Auto-generated if empty.	OpenClaw	true

name	label	type	min	max	default	description	group
openclaw_gateway_port	Gateway Port	number	1024	65535	18789	Host port for first OpenClaw instance. Additional instances auto-increment.	OpenClaw

name

label

type

options

default

description

group

openclaw_gateway_bind

Gateway Bind

select

loopback

lan

loopback

loopback = localhost only, lan = accessible on LAN

OpenClaw

name

label

type

options

default

description

group

api_key_source

API Key Source

select

auto

manual

custom

auto

auto = forward Aegis keys automatically, manual = configure inside OpenClaw, custom = use keys below

API Keys

name	label	type	default	description	group	secret
openai_api_key	OpenAI API Key	string		OpenAI API key (only used when source = custom)	API Keys	true

name	label	type	default	description	group	secret
anthropic_api_key	Anthropic API Key	string		Anthropic API key (only used when source = custom)	API Keys	true

capabilities

live_detection

script	description
scripts/monitor.js	Real-time monitoring and audit of OpenClaw agent activity

Camera Claw

A security camera for your AI agent.

Security cameras watch people. Camera Claw watches AI agents. You wouldn't let a stranger into your house without a security camera — why let an AI agent run on your machine without one?

What It Does

Camera Claw provides three layers:

The Room — A Docker sandbox with a virtual desktop (Xvfb + Chrome) for OpenClaw
The Camera — KasmVNC live view + periodic snapshots with metadata
The DVR — Snapshot timeline with agent logs, network events, and optional VLM analysis

Docker Architecture

Each OpenClaw instance runs in an isolated Docker stack with a virtual desktop:

Service	Image	Purpose
`openclaw-gateway`	`openclaw:local`	AI agent gateway + Control UI (port 18789)
`openclaw-cli`	Same image	CLI for onboarding, channel setup

Inside the container: Xvfb (virtual display :99) + Chrome + KasmVNC (integrated VNC server + web client on :6080).

Multi-Instance Support

Each instance = separate docker-compose stack with unique ports and config.

Protocol

Communicates via JSON lines over stdin/stdout.

All events emitted by CameraClaw on stdout reach the Aegis frontend via skill-response in skill-runtime-manager.cjs. The frontend filters by skillId === 'camera-claw' and dispatches to the appropriate handler.

CameraClaw can request Aegis services (LLM, VLM, system info) via the inline query protocol — no direct HTTP connection needed.

CameraClaw → Aegis (stdout events)

Lifecycle Events

{"event":"ready", "mode":"docker", "openclaw_version":"latest", "monitoring":true}
{"event":"instance_started", "instance_id":"default", "gateway_url":"http://localhost:18789", "kasmvnc_url":"http://localhost:6080", "token":"abc123...", "name":"Default Agent"}
{"event":"instance_stopped", "instance_id":"default", "reason":"user_request"}
{"event":"error", "message":"Docker daemon not running", "retriable":false}

Desktop Monitoring Events

{"event":"vnc_ready", "instance_id":"default", "kasmvnc_url":"http://localhost:6080", "view_only_url":"http://localhost:6080/?viewOnly=true"}
{"event":"snapshot", "instance_id":"default", "path":"/abs/path/snap_001.jpg", "ts":"2026-03-11T14:00:05Z", "screen_diff_pct":42.3}
{"event":"screen_change", "instance_id":"default", "diff_pct":42.3, "snapshot_path":"/abs/path/snap_002.jpg", "ts":"2026-03-11T14:00:07Z"}
{"event":"activity_summary", "instance_id":"default", "status":"active", "ts":"2026-03-11T14:00:10Z", "vlm_summary":"Agent is composing a tweet about AI developments", "vlm_safety":"ok"}
{"event":"idle", "instance_id":"default", "idle_since":"2026-03-11T14:10:00Z", "idle_seconds":120}

Network & Console Events

{"event":"console", "instance_id":"default", "stream":"stdout", "line":"Agent started task: browse twitter", "ts":"..."}
{"event":"network", "instance_id":"default", "remote_ip":"104.244.42.1", "domain":"twitter.com", "remote_port":443, "direction":"outbound", "ts":"..."}
{"event":"alert", "instance_id":"default", "type":"unknown_connection", "detail":"Connection to 185.43.210.1:8080", "ts":"..."}
{"event":"health", "instance_id":"default", "cpu_percent":12.3, "memory_mb":256, "uptime_seconds":3600, "ts":"..."}

Inline Queries (request Aegis services)

CameraClaw can request VLM analysis from Aegis without direct HTTP:

{"query":"vlm_chat", "id":1, "messages":[{"role":"user","content":[{"type":"image_url","image_url":{"url":"data:image/jpeg;base64,..."}},{"type":"text","text":"Describe what the AI agent is doing on screen. Note any concerns."}]}], "max_tokens":256}

Aegis responds on stdin:

{"response":1, "ok":true, "content":"Agent is browsing twitter.com/home, scrolling through the feed. No concerns.", "model":"gemma-3-4b", "usage":{"prompt_tokens":800,"completion_tokens":45}}

Aegis → CameraClaw (stdin commands)

Instance Management

{"command":"create_instance", "instance_id":"work", "name":"Work Agent"}
{"command":"stop_instance", "instance_id":"work"}
{"command":"list_instances"}
{"command":"stop"}

Recording Control

{"command":"pause_recording", "instance_id":"default"}
{"command":"resume_recording", "instance_id":"default"}
{"command":"take_snapshot", "instance_id":"default"}

Desktop Interaction

{"command":"analyze_screen", "instance_id":"default"}

Triggers an immediate VLM analysis of the current screen. CameraClaw captures a snapshot, sends a vlm_chat query to Aegis, and emits an activity_summary event with the result.

Non-command messages

Messages without a command field (e.g. detection frame events from other skills) are silently ignored.

Aegis Frontend Integration

Monitor View (Camera Grid)

The OpenClaw desktop appears as a camera tile using KasmVNC in view-only mode:

// Frontend: embed KasmVNC iframe with viewOnly=true for monitor tile
const iframe = document.createElement('iframe');
iframe.src = viewOnlyUrl;  // http://localhost:6080/?viewOnly=true
iframe.style.cssText = 'width:100%;height:100%;border:none';
tileElement.appendChild(iframe);

Live desktop stream, scaled to thumbnail
Click tile → opens OpenClaw Panel (switches to interactive)
Motion indicator when screen_change events arrive

OpenClaw Panel (Sidebar)

Full interactive KasmVNC session:

// Frontend: embed KasmVNC iframe with full interaction for panel
const iframe = document.createElement('iframe');
iframe.src = kasmvncUrl;  // http://localhost:6080 (interactive)
iframe.style.cssText = 'width:100%;height:100%;border:none';
iframe.allow = 'clipboard-read; clipboard-write';
panelElement.appendChild(iframe);

Full mouse/keyboard control
Clipboard sharing enabled
Used for onboarding, configuration, manual intervention

Recording Pipeline

CameraClaw handles recording internally:

Periodic snapshots at snapshot_fps rate (default 0.5 fps)
Screen diff between consecutive snapshots
If diff > screen_change_threshold → emit screen_change event
Metadata enrichment: each snapshot paired with agent logs + network events
VLM analysis (if enabled): triggered on significant changes or periodically
Storage: snapshots + JSONL metadata in ~/.aegis-ai/media/camera-claw/<instance_id>/

Snapshot Timeline Format

~/.aegis-ai/media/camera-claw/default/
├── 2026-03-11/
│   ├── snaps/
│   │   ├── 14-00-01.jpg
│   │   ├── 14-00-03.jpg
│   │   └── 14-00-05.jpg
│   └── timeline.jsonl      ← enriched metadata per snapshot

Each line in timeline.jsonl:

{"ts":"2026-03-11T14:00:01Z", "snap":"snaps/14-00-01.jpg", "diff_pct":0, "agent_log":"Browsing feed", "network":[{"domain":"twitter.com","bytes":45200}]}
{"ts":"2026-03-11T14:00:05Z", "snap":"snaps/14-00-05.jpg", "diff_pct":42.3, "agent_log":"Composing tweet", "vlm":"Agent composing tweet about AI", "vlm_safety":"ok"}

Installation

./deploy.sh    # Node.js deps + Docker image build (with KasmVNC) + config dir setup

Checks for Node.js ≥18
Runs npm install
Verifies Docker and Docker Compose
Creates ~/.openclaw/ config directory
Builds OpenClaw Docker image (with KasmVNC + desktop packages)
Validates docker-compose.yml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Camera Claw

What It Does

Docker Architecture

Multi-Instance Support

Protocol

CameraClaw → Aegis (stdout events)

Lifecycle Events

Desktop Monitoring Events

Network & Console Events

Inline Queries (request Aegis services)

Aegis → CameraClaw (stdin commands)

Instance Management

Recording Control

Desktop Interaction

Non-command messages

Aegis Frontend Integration

Monitor View (Camera Grid)

OpenClaw Panel (Sidebar)

Recording Pipeline

Snapshot Timeline Format

Installation

FilesExpand file tree

SKILL.md

Latest commit

History

SKILL.md

File metadata and controls

Camera Claw

What It Does

Docker Architecture

Multi-Instance Support

Protocol

CameraClaw → Aegis (stdout events)

Lifecycle Events

Desktop Monitoring Events

Network & Console Events

Inline Queries (request Aegis services)

Aegis → CameraClaw (stdin commands)

Instance Management

Recording Control

Desktop Interaction

Non-command messages

Aegis Frontend Integration

Monitor View (Camera Grid)

OpenClaw Panel (Sidebar)

Recording Pipeline

Snapshot Timeline Format

Installation