Meta-agent sidebar

StackUnderflow ships a right-docked meta-agent sidebar that lets you talk to a local LLM about your own AI coding history — sessions, projects, costs, file touches, past decisions. The model gets a set of read-only backend tools that query the same SQLite store the dashboard reads, so its answers are grounded in your real data.

TL;DR. Run a tool-capable model under Ollama (e.g. qwen2.5-coder or llama3.2). Open StackUnderflow. Ask "what did I spend this month?", "have I dealt with stripe webhooks before?", "who touched src/auth/middleware.py?". The sidebar's LLM calls a backend tool, the route executes it against your local store, and the model answers in its own words. Nothing leaves your machine.

The sidebar is the chat sidebar — there is one chat surface, not two. For Ollama setup and how models are discovered, see docs/chat.md.

What it is

The sidebar has three parts:

A persistent right-side column in the dashboard layout. On wide viewports (>= 1280px) it's expanded by default; on tablet widths (>= 768px) it collapses to a thin icon rail; below that it hides, and the header chat button summons it as an overlay. The expanded/collapsed state persists in localStorage.
A streaming chat surface that talks to POST /api/meta-agent/chat. That route calls your local Ollama instance at http://localhost:11434/api/chat — the same hop the /ollama-api proxy uses.
A backend tool catalogue: thirteen typed read-only tools wrapping discovery, playback, project-summary, cost-rollup, recommendation and PR/CI services. The route hands the catalogue to Ollama on every request; if the model emits a tool_calls array, the route executes each call against the local store, appends the results as role: "tool" messages, and loops up to a 5-hop cap.

Privacy

Everything stays on your machine. The route only opens HTTP connections to localhost:11434 (Ollama). The tool executors only read from ~/.stackunderflow/store.db. There is no fallback to a remote LLM — if Ollama isn't running, the first NDJSON event the route emits is a {"type": "error"} line and the sidebar surfaces a banner. To verify, run lsof -nP -p $(pgrep -f stackunderflow); the only chat-related outbound socket is the one to your local Ollama port.

Which Ollama models support tool-calling

Ollama's /api/chat honours a tools: [...] array only when the model itself was trained on the function-calling shape. Recommended models for the meta-agent, all pullable with ollama pull <name>:

Model	Notes
`qwen2.5-coder:7b`	Strong coding-style tool selection for its size. The recommended default.
`llama3.2`	Smaller and faster; honours tools cleanly. Good for laptops without a discrete GPU.
`llama3.1:8b-instruct`	Strong general-purpose; slightly slower than 3.2 on the same hardware.
`firefunction-v2`	Heavyweight, trained explicitly for function calling — the most reliable tool selector, but large.

The sidebar's heuristic (modelLikelySupportsTools in services/metaAgent.ts) recognises the qwen2.5, llama3.1/llama3.2, firefunction, command-r, mistral-nemo/mistral-large and mixtral families. Pick a model outside those and you get an amber "Tool-calling may not work with X" banner above the composer. The chat still works — the model just won't call tools, so you get general-knowledge answers without grounding from your store.

Tool catalogue

Every tool is a thin wrapper over an existing read-only service or a direct SQL rollup. Each result is capped at ~4 KB of text (_RESULT_CHAR_BUDGET) so a noisy payload can't blow the model's context window. GET /api/meta-agent/tools returns the catalogue if you want to introspect it.

Tool	Backs onto	What it answers
`search_past_decisions(query, limit?, project?, since?)`	`services.discovery.search_past_decisions`	"Have I dealt with X before?"
`find_sessions_in_path(path, since?, limit?)`	`services.discovery.find_sessions_in_path`	"Show me sessions in this directory."
`find_sessions_touching_file(file, mode?, limit?)`	`services.discovery.find_sessions_touching_file`	"Who touched this file?"
`get_project_summary(slug?)`	direct SQL on `projects` + `sessions` + `messages`	"What's the state of this project?"
`get_cost_summary(period?, limit?)`	`reports.aggregate.build_report`	"What did I spend this month?"
`get_session_playback(session_id, at?)`	`services.playback_fs.reconstruct_fs_at`	"What did the agent change in session X?"
`recommend_mode(prompt, current_model?)`	`services.mode_recommender.recommend`	"Could a cheaper model handle this task?"
`get_burn_projection()`	`services.burn.build_projection` + `services.plans.compute_usage`	"Will I overrun this month?"
`list_recent_sessions(project?, limit?)`	direct SQL on `sessions` + `projects`	"What did I work on lately?"
`recommend_skills(project?, threshold?, window_days?)`	`services.skill_recommender.recommend_skills`	"What should I automate in this project?"
`get_pr_outcomes(repo, state?, since?, limit?)`	direct SQL on `pr_outcomes`	"What PRs landed in repo X?"
`get_ci_runs(commit_sha?, status?, repo?, limit?)`	direct SQL on `ci_runs`	"Did CI pass on commit X?"
`get_file_risk(path, since?)`	`services.risk.file_risk_summary`	"Have I broken this file before?"

get_project_summary and recommend_skills accept an optional slug / project; when omitted they fall back to the project you're currently viewing in the dashboard (the route passes it as current_slug). get_pr_outcomes and get_ci_runs read PR and CI rows from webhook/REST ingest — empty until you've ingested any.

When a tool returns over the budget, the result is trimmed — the longest list-typed fields are halved repeatedly — and a _truncated: true marker tells the model the response was cut.

Wire format

POST /api/meta-agent/chat accepts {messages, model, tools_enabled, project_slug?} and returns application/x-ndjson — one JSON object per line, each carrying a type discriminator:

{"type": "token", "delta": "you have ", "ts": "2026-05-15T12:01:33Z"}
{"type": "tool_call", "id": "call_1", "name": "list_recent_sessions", "args": {"limit": 5}, "ts": "..."}
{"type": "tool_result", "id": "call_1", "name": "list_recent_sessions", "ok": true, "data": {...}, "duration_ms": 12, "ts": "..."}
{"type": "error", "message": "Ollama not reachable", "ts": "..."}
{"type": "done", "hops": 2, "ts": "..."}

token — a chunk of the final assistant text.
tool_call / tool_result — paired by id; rendered as collapsed <details> blocks in the sidebar.
error — terminal; the loop bailed (Ollama down, bad request, malformed response).
done — terminal; the model produced a content-only turn. hops records how many round-trips to Ollama it took.

The maximum number of tool-call hops in one user turn is 5 (MAX_TOOL_HOPS in stackunderflow.services.meta_agent). A model that keeps emitting tool calls without ever producing a content turn hits this cap, and the route emits a single error event explaining what happened.

Frontend surfaces

Sidebar: components/layout/MetaAgentSidebar.tsx — the column itself (rail / docked / overlay modes); persistence helpers in metaAgentSidebarHelpers.ts.
Chat surface: components/discussion/MetaAgentInterface.tsx — model selector, session manager, message list, composer.
Tool-call surface: components/discussion/ToolCallSurface.tsx — the inline collapsed <details> block. Pure helpers (buildToolStatusLabel, buildToolSummary) live in toolCallSurfaceHelpers.ts for unit testing.
Service client: services/metaAgent.ts — the NDJSON stream parser, listTools(), and the modelLikelySupportsTools() heuristic.

Cookbook

A few questions the meta-agent handles well:

"What's the cheapest project I've worked on this month?" → calls get_cost_summary(period="month") and reads top_projects[-1].
"Did I ever fix a flaky test in tests/stackunderflow/etl/?" → calls find_sessions_in_path(path="tests/stackunderflow/etl", since="180d") and summarises matches.
"Show me what the agent changed in session ab123..." → calls get_session_playback(session_id="ab123...") and lists the touched files.
"Have I ever dealt with stripe webhooks before?" → calls search_past_decisions(query="stripe webhooks") and surfaces snippets.
"Will I overrun my plan this month?" → calls get_burn_projection() and quotes the projected month-end / days-to-limit / alert.
"Is cost.py a risky file to edit?" → calls get_file_risk(path="cost.py") and reports the revert / failure / success counts.

Extending the catalogue

A new tool needs three additions:

Catalogue entry in stackunderflow/services/meta_agent.TOOL_CATALOG — the JSON schema for its params.
Executor function in the same module, registered in the _EXECUTORS dispatcher table.
Test in tests/stackunderflow/routes/test_meta_agent_route.py that seeds the store, calls meta_agent.execute_tool(...), and asserts the result shape.

The dispatcher always returns a ToolResult — it never raises — so a buggy executor at worst produces an ok=False, data={"error": "..."} payload, which the model can see and react to.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Meta-agent sidebar

What it is

Privacy

Which Ollama models support tool-calling

Tool catalogue

Wire format

Frontend surfaces

Cookbook

Extending the catalogue

FilesExpand file tree

meta-agent.md

Latest commit

History

meta-agent.md

File metadata and controls

Meta-agent sidebar

What it is

Privacy

Which Ollama models support tool-calling

Tool catalogue

Wire format

Frontend surfaces

Cookbook

Extending the catalogue