better-robotics · jonasneves · May 24, 2026 · May 24, 2026
diff --git a/.claude/CLAUDE.md b/.claude/CLAUDE.md
diff --git a/.claude/direction.md b/.claude/direction.md
@@ -1,229 +1,34 @@
-# Architectural direction — better-robotics
+# Direction
 
-Long-horizon shape decisions. Unlike `working.md` (tactical pending), this file names structural moves the project is committing to. Updated when the shape of the system changes.
+What we're committing to close for the Fall 2026 course pilot. Open exploration lives in `exploration.md`; positioning research in `field.md`.
 
-## 1. Generic typed-characteristic runtime (in flight)
+## Ranked gaps
 
-**Claim.** Every capability today exists in ~3 places (browser module, Pi
-handler, ESP32 handler). 80% of those files are boilerplate isomorphic to
-the capability's TYPE, not its identity. A generic runtime keyed on type
-eliminates the boilerplate.
+1. **Live parameters get/set.** Learning step 2 — "tune parameters live without editing code." Capabilities are already structured; add a param characteristic + getter/setter + a tuner row per capability card. Cheapest gap to close.
 
-**The data already exists.** `fw-info.caps` declares typed schemas:
+2. **Sensor/motor hot-plug auto-discovery.** Learning step 3 — "add physical components." Today capabilities are firmware-declared, not detected at runtime. The "smart breadboard" promise. Tractable on ESP32 with an i2c scan + a discovery characteristic.
 
-```json
-{ "name": "led",    "char": "…d92", "type": "toggle" }
-{ "name": "motors", "char": "…d99", "type": "signed-pair", "range": [-100, 100] }
-{ "name": "wifi",   "chars": {...}, "type": "wifi-scan" }
-{ "name": "ota",    "chars": {...}, "type": "bundle-ota" }
-{ "name": "camera", "chars": {...}, "type": "webrtc-installable" }
-{ "name": "ops",    "char": "…d9c", "type": "command" }
-```
+3. **Pub/sub vocabulary for messaging.** Learning step 5 and the explicit ROS2-transition story. Could be a topic layer over BLE notify, with MQTT as the "once WiFi is on" tier. Earns its keep only if the ROS2-prep framing stays.
 
-**The runtime (browser side).** A per-type constructor `makeXxxCap(schema)`
-returns `{probe, cleanup, renderSection, wireActions, postRender?}`. Adding
-a capability of a known type = one schema entry + zero JS code.
+4. **Discovery graph view.** Cheap once (3) lands. Makes pub/sub legible — pedagogical payoff for low effort.
 
-**Firmware-side direction (farther out).** Pi and ESP32 firmware have
-identical ceremony: register char, parse read/write, notify on change,
-gate on config. A "typed char runtime" on firmware reads the capability
-declaration and handles generic typed chars with a small driver binding
-per capability (`{ on_write: fn, on_read: fn }`).
+5. **Simulation hook.** A `MockRobot` so a student can iterate scripts without a working kit on the table. Classroom-critical; not everyone has hardware ready every session.
 
-**Progress so far:**
-- fw-info.caps carries the typed schema (shipped)
-- Browser reads + stores `entry.capSchema` (shipped)
-- Each capability module exports its own `schema` for cross-check (shipped)
-- **First type migrated: `toggle` → LED** (this session)
-- Future types to migrate: `signed-pair`, `wifi-scan`, `bundle-ota`,
-  `webrtc-installable`, `command`. Each is ~2–4 hours.
+## NFC tap-to-pair
 
-**Migration strategy.** Per-type, not per-capability. When we migrate
-`signed-pair`, both motors AND any future 2-axis input use the same
-runtime. The compound payoff is the Nth capability, not the first.
+The plan's original NFC role (handing the phone the puck's SoftAP creds) is dead post-BLE-first. Tags can still earn their keep as a *tap-to-pair-this-specific-robot* shortcut — collapses "scan → find robot-7 in a list of 12 → confirm" to a single tap.
 
-## 2. AI-maintained documentation (cheap, deferred)
+- **Tag content:** NDEF URL → `https://better-robotics.github.io/?pair=<robot-id>`. Dashboard reads `pair` from `location.search`, filters BLE scan to that device.
+- **Android Chrome:** tap → URL → filtered scan → confirm.
+- **iPhone:** iOS opens the URL but Web Bluetooth is unavailable. Workaround uses the existing phone↔desktop pair layer (`signal.neevs.io`, signed pair-request, `phone.html`): encode `phone.html?pair=<robot-id>`. Phone forwards `{type:"pair-robot", robotId}` over WebRTC; desktop surfaces a "Phone wants to pair robot-7 — click to confirm" banner. Desktop click is required because `navigator.bluetooth.requestDevice` needs a user gesture. Cross-network works for free.
+- **Bootstrap caveat:** first-ever use still needs the existing phone↔desktop pair ceremony.
 
-**Claim.** `README.md`, `HARDWARE.md`, `firmware/pi_robot/README.md`, and
-per-capability comments all describe what `fw-info.caps` + the code
-already know. They drift. An AI agent watching the schema + commit log
-can regenerate docs per release.
+~An afternoon to prototype. Concrete iOS+NFC demo defuses the "BLE-first leaves iPhones out" objection.
 
-**Scope.** ~2 days to wire a pre-commit generator plus a CI check that
-fails if docs aren't regenerated. Starts small: capability reference
-page auto-generated from the live schema. Expands to change-log
-summarization from commit messages.
+## Other gaps
 
-**Not urgent.** Doc drift isn't causing failures today. Worth doing
-when the project has contributors outside the core, or when we promise
-backward-compatibility guarantees that require accurate docs.
+- **Visual / block-based authoring tier.** Today: capability cards (drive motors, toggle LED) and `pip.ask` natural language — no block-editor surface for "when distance < 30cm, stop and turn right." XRP and MicroBlocks (see `.claude/field.md`) ship Blockly. Open question: do cards + Pip cover non-coder authoring, or is a drag-drop tier needed?
 
-## 3. Transparent-data-plane OTA (partially in flight)
+- **Inter-puck messaging.** Every message fans out through the browser as hub — no puck↔puck path. Tied to (3) but a separate architectural commitment.
 
-**Claim.** Every robot should have three OTA lanes with a clear fallback
-order. The dashboard picks the fastest available without user
-intervention. Iteration-loop speed is the core dev experience; "how fast
-does code get onto the robot" sets the tone for everything else.
-
-**The three lanes, decreasing friction:**
-
-1. **BLE-stream** — always works, no WiFi needed, no LAN co-location
-   required. Baseline for every robot on every network.
-   Today: `writeValueWithResponse` + ATT ack per 180-byte frame →
-   3-10 min for a 1.6 MB bin. Switching to
-   `writeValueWithoutResponse` + software flow control over
-   `ota-status` gets it to ~30 sec. **Not yet implemented.**
-
-2. **PNA direct to target robot** — dashboard fetches
-   `http://<robot-ip>/ota` straight from the browser. Chrome/Edge's
-   Private Network Access (shipped 2022) gates the first request on a
-   one-time user consent per origin. No TLS on the robot, no cert
-   ceremony, no crypto IRAM pressure. ~1 sec for a 1.6 MB bin over
-   LAN. Works whenever the dashboard and robot share a network.
-   **Not yet implemented on ESP32** (Pi doesn't need this lane — BLE
-   bundle OTA is already fast enough for Pi-sized updates).
-
-3. **Pi-as-gateway** — for multi-robot orchestration and offline-first
-   classroom deployments. Pi runs an `aioquic` WebTransport server
-   with a self-signed cert; dashboard uses `serverCertificateHashes`
-   pinning (cert sha256 published in Pi's fw-info) to connect
-   without PKI ceremony. Pi proxies raw TCP to the target ESP32 on
-   the LAN. Same ~1 sec speed as PNA direct, with bonus orchestration
-   surface (mesh multiple ESP32s, serve dashboard offline).
-   **Not yet implemented.** Earns its slot when multi-robot coord or
-   offline-first use cases land, not purely for OTA speed.
-
-**Why the three-lane shape is right:**
-- Lane 1 works on BLE only. No WiFi assumption.
-- Lane 2 works when browser and robot share a LAN. Most common case.
-- Lane 3 works when the fleet has a Pi (most Better Robotics fleets do).
-
-Dashboard tries fastest available, falls back automatically. User never
-picks a lane — it just updates as fast as the topology allows.
-
-**What's baked in vs what's not:**
-- BLE-stream as a baseline works today (for Pi bundle OTA; for ESP32
-  single-binary OTA, the WithResponse variant is live and slow).
-- ESP32 already runs a raw `WiFiServer` (for MJPEG) — adding a `/ota`
-  endpoint on the same task is near-zero new code on the firmware side.
-- Pi-as-gateway is purely additive to `pi_robot.py` — every Pi ships
-  with it, no opt-in, just one more capability.
-- Dashboard-side lane selection: not yet written. Attempts lanes in
-  order, falls back on timeout/error.
-
-**Sequencing:**
-1. BLE-WithoutResponse first (universal, smallest change).
-2. PNA + ESP32 `/ota` endpoint second (big bang for effort).
-3. Pi-as-gateway when its orchestration/offline story earns it.
-
-## 4. ESP32 build-as-a-service (bold, later)
-
-**Claim.** ESP32 firmware is purely deterministic from `{board, caps}`.
-Users currently install `arduino-cli` + core + toolchain to compile.
-If a service accepts a config and returns a signed `.bin`, the dashboard's
-"Flash firmware" button fetches a per-robot-config binary; no local dev
-environment is needed for adding capabilities.
-
-**Constraint.** The service has to be reliable enough that users aren't
-stuck if it's down. Either (a) same-origin build on GitHub Actions, or
-(b) a small hosted build service, or (c) in-browser compile via
-something like Wokwi's WebAssembly toolchain (the bold option).
-
-**The compound effect.** Combined with #1, adding an ESP32 capability
-becomes: declare schema, bind driver code in a capability driver DSL,
-click Flash. No C++, no toolchain, no linker flags.
-
-**Worth it when.** Project has contributors who want to add capabilities
-without learning the ESP32 toolchain. Today the audience is small enough
-that `make flash` is fine.
-
-## 5. Closed-loop visual control: draw-a-path (next, after overhead ArUco validates)
-
-**Claim.** Once an overhead camera + marker is established (the
-`aruco.js` work — overhead localization writing `entry.arucoPosition`
-per scan), the natural next layer is closed-loop control driven from
-that pose. Operator props the phone (or local webcam) overhead, finger-
-draws a path on the phone screen, the robot follows it. New sensor
-isn't needed; the pose primitive is already shipping.
-
-**The hard sub-problem isn't drawing or motor control — it's pose
-reliability.** Without knowing where the robot is each frame, the
-closed loop doesn't close and the robot drifts within seconds. The
-overhead ArUco surface gates this — until metric accuracy is
-validated against tape-measure ground truth (see `.claude/notes.md` →
-"Wired but unproven"), don't build the follower on top.
-
-**Right primitives, in order of load-bearing-ness:**
-- **Pose**: ArUco overhead, already shipped. Producer writes
-  `entry.arucoPosition`; consumer (this work) must gate on
-  `Date.now() - updatedAt` for staleness.
-- **Where compute lives**: dashboard runs detector + controller +
-  emits pulse-bounded BLE motor writes. Phone is I/O. Robot
-  unchanged. Same control-plane / data-plane split as everything else.
-- **Tech**: `js-aruco2` already in. Pure-pursuit controller in plain
-  JS (~50 lines).
-- **Control loop budget**: detect (~15 ms) + plan (~1 ms) + BLE
-  pulse (~50 ms) ≈ 70 ms / iter → ~14 Hz. Each iteration emits a
-  short pulse (`duration_ms ≈ 100 ms`); firmware watchdog auto-stops
-  if the next iter doesn't arrive. The existing pulse-bounded-motion
-  + watchdog invariants are the safety floor — same discipline as
-  Pip / user scripts.
-
-**Phases:**
-1. **Path source.** `<canvas>` overlay on `phone.html` viewfinder;
-   touch listeners build a stroke-point array; send over the existing
-   WebRTC data channel as a typed message
-   (`{type: "path", points: [[x, y], ...]}`). Dashboard receives and
-   renders on the helper's SVG overlay alongside marker outlines. No
-   motors yet.
-2. **Closed-loop follower.** Pure-pursuit drives the most-recent
-   path; pulse-bounded each iter; safety stops on marker-loss ≥ 1 s,
-   end-of-path, or tap-to-cancel from the phone.
-3. **Pip tool surface.** `get_robot_pose(robot_id)` returns
-   `{x, y, theta, confidence}` from `entry.arucoPosition`. Optional;
-   not on the MVP critical path.
-
-**Validation criterion.** Tape marker on a rover, prop a phone or
-webcam overhead, draw a curved path on the phone screen, watch the
-robot trace within ~5 cm of the line over 1-2 m. If shipping leaves
-the rover drifting off-line within seconds, or the loop falls below
-5 Hz on target hardware, the primitive isn't load-bearing — redesign
-before extending.
-
-**Scope honesty.** This flips part of CLAUDE.md's "Not spatially
-aware" stance: when an overhead camera + marker is present, the
-robot has a known 2D pose. Not SLAM, not depth — just fiducial-
-bounded planar pose. Do NOT update CLAUDE.md until phase 2 lands and
-the validation criterion passes; claiming a capability before it
-works is the worst kind of scope drift.
-
-**Failure modes to watch:**
-- **Marker lost.** Phone is hand-held — will shake, tilt, occlude.
-  > 1 s loss → safety stop. Not optional; this IS the safety story
-  for this loop.
-- **"Phone overhead" geometry assumption.** If held at angle, the
-  floor isn't co-planar with the image. For short paths and small
-  angles, marker pixel position is good enough as a proxy. Larger
-  paths earn a homography (4 known floor points OR phone IMU +
-  marker scale) — defer until path length actually demands it.
-- **Detector latency budget.** Open-vocab "drive toward the yellow
-  cup" routes through Claude vision (~1–2 s round-trip), too slow
-  for a 5 Hz loop. A reflex-tier open-vocab detector earns its way
-  only when that specific use case lands — until then the ArUco-
-  pose loop is self-contained and MediaPipe COCO handles closed-
-  vocab reflex needs.
-
-## What this list doesn't include
-
-These ideas were considered and rejected or deferred for specific reasons
-— recording them here so we don't re-rehash:
-
-- **Running without Linux on the Pi (bare-metal).** Loses Python, gpiozero,
-  systemd, apt. Not a simplification; a regression. The Pi being a real
-  computer is the feature.
-- **Replacing BLE GATT with a custom protocol.** GATT is a standard with
-  tooling, debuggers (`bluetoothctl`, `nRF Connect`), and cross-platform
-  support. Reinventing would be faster to design and slower forever.
-- **Making the dashboard a conversational (chat-only) UI.** Visual
-  feedback for video, logs, and pinout has better throughput than text.
-  The LLM-orchestrator direction adds chat alongside, doesn't replace.
+- **Coordinate frames + time sync.** Required for sensor fusion and any multi-puck localization. Nothing in the scaffold addresses them.