Prompd · sbaker · Apr 19, 2026 · Apr 19, 2026
diff --git a/docs/AIDEN-PLAN.md → docs/PROMPD-AI-PLAN.md b/docs/AIDEN-PLAN.md → docs/PROMPD-AI-PLAN.md
@@ -1,9 +1,9 @@
-# AIDEN — AI Dev EN
+# Prompd AI — on-device local runtime
 
-On-device AI assistant bundled alongside Prompd for internal generative features
-(workflow scaffolding, file generation, explain/fix/rewrite, frontmatter suggestions).
-
-Name is tentative: AIDEN = "AI Dev EN". Previously referred to as "AI Fox."
+On-device AI runtime bundled alongside Prompd for internal generative features
+(workflow scaffolding, file generation, explain/fix/rewrite, frontmatter
+suggestions). Exposed via `prompd ai` CLI and a local OpenAI-compatible HTTP
+endpoint.
 
 ---
 
@@ -12,16 +12,16 @@ Name is tentative: AIDEN = "AI Dev EN". Previously referred to as "AI Fox."
 - **Phase:** 0 — Planning (this document)
 - **Last updated:** 2026-04-19
 - **Blockers:** none
-- **Next action:** confirm plan with Stephen, pick prototype platform, start Phase 1
+- **Next action:** start Phase 1 in `prompd-cli`
 
 ---
 
 ## Objective
 
-Ship a local inference runtime that Prompd owns end-to-end, so AIDEN features
-work with zero per-user cloud cost, zero required API keys, and offline-capable.
-Runtime is shared across the Prompd ecosystem (desktop app, CLI, future VS Code
-extension) via a local OpenAI-compatible HTTP endpoint.
+Ship a local inference runtime that Prompd owns end-to-end, so internal AI
+features work with zero per-user cloud cost, zero required API keys, and
+offline-capable. Runtime is shared across the Prompd ecosystem (desktop app,
+CLI, future VS Code extension) via a local OpenAI-compatible HTTP endpoint.
 
 ---
 
@@ -30,74 +30,90 @@ extension) via a local OpenAI-compatible HTTP endpoint.
 - **Not** a substitute for cloud providers (OpenAI / Anthropic / Groq) on
   user-authored prmd executions. Those continue to use the existing provider
   abstraction in `@prompd/core` with the user's own keys.
-- **Not** a general-purpose LLM host. AIDEN is scoped to Prompd's internal
+- **Not** a general-purpose LLM host. Prompd AI is scoped to Prompd's internal
   generative features only (scaffolding, explain, fix, generate, rewrite).
 - **Not** a managed cloud service. Vertex AI path was evaluated and rejected
   (cost: ~$3,780/mo/replica at the Model Garden one-click default).
 - **Not** a required install. Fully opt-in; Prompd's base install and cloud-
-  provider features work without AIDEN.
+  provider features work without it.
 
 ---
 
 ## Architecture — Prompd-wide local daemon
 
 ```
-~/.prompd/aiden/
+~/.prompd/ai/
 ├── bin/              runtime binary (llama-server, platform-specific)
 ├── models/           weight files (content-addressed by SHA)
 ├── adapters/         LoRA adapters (prmd-tuned variants)
 ├── config.json       port, default model, mem limits, auth token
-├── aiden.pid         running daemon PID
-└── aiden.lock        discovery file (port, pid, model, started-at)
+├── daemon.pid        running daemon PID
+└── daemon.lock       discovery file (port, pid, model, started-at)
 ```
 
 - **Runtime:** vendored `llama-server` from llama.cpp (~15MB per platform).
   Already speaks OpenAI-compatible HTTP — drops into `@prompd/core`'s provider
   abstraction as just another endpoint.
-- **Lifecycle owner:** `prompd-cli` — five-verb CLI surface:
-  - `prompd aiden install` — hardware probe, download binary, download weights,
-    write config. Idempotent. Re-runs handle model swaps and updates via
-    `--model <name>` (e.g., `--model e4b`, `--model prompd/aiden-prmd@0.2.0`).
-  - `prompd aiden uninstall` — wipe `~/.prompd/aiden/` entirely.
-  - `prompd aiden start` — launch daemon on local port.
-  - `prompd aiden stop` — graceful shutdown.
-  - `prompd aiden status` — running state, model loaded, version, memory, port,
-    installed models list.
-  - Interactive by default in a terminal; `-y` / `--yes` for headless/scripted
-    runs (also used by the desktop app when it triggers lifecycle via IPC).
+- **Lifecycle owner:** `prompd-cli` — five-verb CLI surface. Interactive by
+  default in a terminal; `-y` / `--yes` for headless/scripted runs (also used
+  by the desktop app when it triggers lifecycle via IPC).
+
+  | Command | Behavior |
+  |---|---|
+  | `prompd ai install --model <name>` | Download + add to catalog. **Additive** — multiple models can be installed side-by-side. Idempotent (re-runs are no-ops). |
+  | `prompd ai uninstall --model <name>` | Remove one model from the catalog. |
+  | `prompd ai uninstall` | Wipe `~/.prompd/ai/` entirely. `-y` to skip confirm. |
+  | `prompd ai start` | Launch daemon serving the default model. |
+  | `prompd ai start --model <name>` | Launch daemon serving a specific installed model. |
+  | `prompd ai stop` | Graceful shutdown. |
+  | `prompd ai status` | Running state, currently serving model, version, memory, port, full installed-models catalog. |
+
 - **Transport:** HTTP on `127.0.0.1:<port>` only (never binds to a public
   interface). Optional bearer token in `config.json` to prevent other local
   apps from snooping.
-- **Discovery:** fixed default port with `aiden.lock` fallback so consumers
+- **Discovery:** fixed default port with `daemon.lock` fallback so consumers
   (desktop app, CLI, extension) find the running daemon reliably even if the
   port was overridden.
 - **Process model:** auto-start with desktop app, stop on app quit. CLI users
-  control explicitly via `prompd aiden start/stop`.
+  control explicitly via `prompd ai start` / `stop`.
+
+---
+
+## Model naming convention
+
+| Pattern | Example | Meaning |
+|---|---|---|
+| `<family>-<size>` | `gemma-4-e4b` | Base upstream model |
+| `<family>-<size>-<quant>` | `gemma-4-e4b-q4` | Quantization-specific tag |
+| `<namespace>-<variant>@<version>` | `prompd-prmd@0.0.1` | Prompd-published tuned variant |
+
+`prompd-prmd@0.0.1` is the first planned Prompd-tuned variant — Gemma 4 E4B
+LoRA-tuned on the prmd corpus.
 
 ---
 
 ## Component map (cross-repo)
 
 | Repo | What it owns |
 |---|---|
-| `prompd-cli` (typescript) | `aiden` subcommand, lifecycle, model catalog, download/verify, hardware probe, `aiden.lock` management |
+| `prompd-cli` (typescript) | `ai` subcommand, lifecycle, model catalog, download/verify, hardware probe, `daemon.lock` management |
 | `prompd-app` (this repo) | Settings → AI panel, enable/disable flow, status bar indicator, IPC to CLI for lifecycle, AI feature call sites |
-| `@prompd/core` | New `aiden` provider that targets `127.0.0.1:<port>` using existing OpenAI-compat client |
-| `registry.prompdhub.ai` | Hosts AIDEN weight manifests + prmd-tuned adapter releases (CDN-backed) |
+| `@prompd/core` | New `prompd-ai` provider that targets `127.0.0.1:<port>` using existing OpenAI-compat client |
+| `registry.prompdhub.ai` | Hosts weight manifests + prmd-tuned adapter releases (CDN-backed) |
 
 ---
 
 ## UX flows
 
 ### Opt-in
 
-1. User clicks **Enable AIDEN** (Settings → AI, or CTA on first AI-feature click).
+1. User clicks **Enable Prompd AI** (Settings → AI, or CTA on first AI-feature click).
 2. Modal: value prop + disk/RAM requirements.
 3. Hardware probe: RAM ≥ 6GB free, disk ≥ 3GB, GPU detected (Metal / CUDA / CPU).
 4. Pick model variant (defaulted by hardware): E2B Q4 for low-spec, E4B Q4 recommended.
 5. Download binary + weights with progress, pause/resume.
-6. `prompd aiden start` fires; daemon binds `127.0.0.1:<port>`.
-7. Status bar shows "AIDEN ready." AI features now work locally.
+6. `prompd ai start` fires; daemon binds `127.0.0.1:<port>`.
+7. Status bar shows "Prompd AI ready." AI features now work locally.
 
 ### Opt-out
 
@@ -108,7 +124,7 @@ extension) via a local OpenAI-compatible HTTP endpoint.
 
 ### Uninstall
 
-- Settings → AI → **Remove AIDEN** wipes `~/.prompd/aiden/`.
+- Settings → AI → **Remove Prompd AI** wipes `~/.prompd/ai/`.
 - Uninstalling Prompd itself prompts to keep or delete the directory (user may
   reinstall and want tuned adapters preserved).
 
@@ -123,7 +139,7 @@ extension) via a local OpenAI-compatible HTTP endpoint.
 | Linux/Windows + NVIDIA | ~2.7GB (+CUDA runtime ~150MB) | ~4–6GB VRAM preferred |
 | Prompd-tuned variant | +100–200MB LoRA on top of base | negligible added |
 
-Runtime is 0 RAM when app is closed or AIDEN is explicitly stopped.
+Runtime is 0 RAM when app is closed or daemon is explicitly stopped.
 
 ---
 
@@ -137,14 +153,14 @@ Goal: prove the runtime + CLI wrapper feel right on one platform.
 - Implement the five-verb CLI surface (`install / uninstall / start / stop /
   status`) in `prompd-cli`, with `-y` headless flag
 - Hardcode Gemma 4 E4B Q4 manifest for `install` in Phase 1
-- Smoke-test from the desktop app via an `aiden` provider in `@prompd/core`
+- Smoke-test from the desktop app via a `prompd-ai` provider in `@prompd/core`
 - Decision gate: does the boot + first-token feel acceptable?
 
 ### Phase 2 — Platform coverage (1–2 weeks)
 
 - Add Linux (CPU + CUDA) and Windows (CPU + CUDA) binary bundles
 - Hardware probe (RAM / disk / GPU detection) cross-platform
-- Graceful shutdown, PID management, port discovery via `aiden.lock`
+- Graceful shutdown, PID management, port discovery via `daemon.lock`
 - Download progress, resume, integrity verification (SHA manifest)
 
 ### Phase 3 — Desktop UX (1 week)
@@ -158,7 +174,8 @@ Goal: prove the runtime + CLI wrapper feel right on one platform.
 
 - Assemble prmd corpus (public prmd files, docs, format spec, examples)
 - LoRA-tune Gemma 4 E4B on one-off rented L4 (~$20 one-time)
-- Publish adapter to registry, wire `prompd aiden model pull prompd/aiden-prmd`
+- Publish adapter to registry as `prompd-prmd@0.0.1`
+- Wire `prompd ai install --model prompd-prmd@0.0.1`
 
 ### Phase 5 — Polish & release
 
@@ -181,12 +198,12 @@ All decisions below are **resolved** unless marked open.
   on the managed list; E2B/E4B aren't. Fine-tuning a serverless model isn't
   possible either (tuned variants require dedicated endpoints), which kills the
   "train it on prmd material" requirement.
-- **2026-04-19 — Use case is AIDEN internal only.** Not for executing user-
-  authored prompts. That path continues to use cloud providers via the existing
-  abstraction.
+- **2026-04-19 — Use case is internal generative features only.** Not for
+  executing user-authored prompts. That path continues to use cloud providers
+  via the existing abstraction.
 - **2026-04-19 — Scope is Prompd-wide, not desktop-only.** Runtime lives in
-  `~/.prompd/aiden/` and is managed by `prompd-cli`. Desktop app, CLI, and
-  future consumers all connect to the same local endpoint.
+  `~/.prompd/ai/` and is managed by `prompd-cli`. Desktop app, CLI, and future
+  consumers all connect to the same local endpoint.
 - **2026-04-19 — Engine: vendor `llama-server`, not Ollama, not custom.**
   `llama-server` is ~15MB, speaks OpenAI-compat natively, rides llama.cpp
   upstream improvements for free. Ollama was rejected for bundle size and
@@ -198,11 +215,19 @@ All decisions below are **resolved** unless marked open.
   spec).** Effective 4B is the sweet spot for structured codegen on prmd's
   known schema. 26B unnecessary for this use case.
 - **2026-04-19 — Opt-in only, never forced.** Base install stays lean;
-  AIDEN-dependent features are CTA'd, not hidden vs. shown TBD.
-- **2026-04-19 — CLI surface locked at five verbs.** `install / uninstall /
-  start / stop / status`. No separate `model pull` / `update` verbs — `install`
-  is idempotent and handles model swaps / updates via `--model <name>`.
-  Interactive by default; `-y` / `--yes` for headless.
+  dependent features are CTA'd, not hidden vs. shown TBD.
+- **2026-04-19 — CLI name is `prompd ai` (descriptive, not branded).**
+  "AIDEN" / "AI Fox" codenames were considered and rejected — the "Aiden" space
+  is crowded in the AI-assistant market (aiden.solutions, aidenai.com, Olark's
+  Aiden, App Store Aiden, Twitter-acquired org at github.com/aiden). Plain
+  `prompd ai` is impossible to trademark-collide on, immediately intuitive in
+  `--help` output, and frees the brand layer to evolve independently of the
+  CLI surface. Commands: `install / uninstall / start / stop / status`,
+  `-y` for headless, `--model` selects which model to install or serve.
+- **2026-04-19 — `install` is additive, not a swap.** Multiple models can live
+  in the catalog side-by-side. `start --model <name>` picks which one to
+  serve. `uninstall --model <name>` removes one; bare `uninstall` wipes
+  everything.
 
 ---
 
@@ -213,8 +238,8 @@ All decisions below are **resolved** unless marked open.
 - **Default port number** — pick a stable Prompd-reserved port (like Ollama's
   11434). Needs to be unused, documented, and overridable.
 - **Windows service vs. app-lifecycle daemon only** — Windows users might want
-  AIDEN always-on for CLI/VS Code extension usage. Systemd/launchd/service
-  installation is out of scope for Phase 1.
+  the daemon always-on for CLI/VS Code extension usage. Systemd/launchd/
+  service installation is out of scope for Phase 1.
 - **Weight hosting location** — registry CDN, HuggingFace Hub, or GCS? Leaning
   registry-hosted for brand and control, with HuggingFace mirror for community
   discovery.
@@ -236,8 +261,10 @@ All decisions below are **resolved** unless marked open.
   is mandatory. Consider torrent-style distribution later if scale warrants.
 - **Windows + NVIDIA install pain.** CUDA driver version mismatches are a
   support headache. Ship with a clear "CPU fallback if GPU init fails" path.
-- **Branding: "AIDEN"** name collision search not done yet. Needs a quick IP
-  check before any public mention.
+- **LocalAI name-space overlap.** An existing open-source project
+  (github.com/mudler/LocalAI) does a similar self-hosted OpenAI-compat server.
+  Never market Prompd AI with language that reads as "Local AI" as a product
+  name — use "Prompd AI" or "on-device AI" consistently.
 
 ---