FSM: union allow_always grants into action toolset (fixes #280) by odk- · Pull Request #302 · friday-platform/friday-studio

odk- · 2026-05-13T14:00:31Z

Summary

request_tool_access advertised itself as a permission gate but never affected what tools the LLM could actually call. buildTools (packages/fsm-engine/fsm-engine.ts) snapshots the toolset once per action and ignored ToolAccessGrants; the grant store was only consulted by request_tool_access itself to short-circuit duplicate elicitations. A user who allow_always'd workspace-mcp/create_task would see granted: true, then watch the next action's tool call fail because nothing in production code read that grant.

This PR closes the gap end-to-end across four commits.

What changes

1. `ToolAccessGrants.listForWorkspace` + storage server-awareness

New listForWorkspace method on the storage adapter + facade. KV prefix scan over keySegment(workspaceId).*, drains keys before fetching values (matching the jetstream-adapter pattern to dodge the nats.js v2.29 iterator-termination bug).
ToolAccessGrantSchema gains an optional serverId field. Additive; legacy grants keep deserializing.
grantAlways derives serverId from a qualified toolName when the caller doesn't pass one explicitly. KV key derivation is unchanged (still based on the full toolName), so hasGrant lookups by the original LLM-facing string keep resolving both legacy and new grants.
listForWorkspace returns ListedGrant[] with { toolName, bareToolName, serverId? }. bareToolName is what mcpResult.tools is keyed by; serverId is the source MCP server when known.

2. `buildTools` union + eager-load

Hoist the grant fetch above effectiveConfigs choice so the engine can react to grants before MCP servers are loaded.
Any grant-referenced workspace MCP server the workspace still declares is eagerly added to effectiveConfigs before createMCPTools. Closes the "strictly-qualified action + grant for a different server's tool" failure mode — a grant for gmail/send_email now works even when the action declares only google-calendar/list_events.
Stale grants (server dropped from workspace config since the grant was made) are silently skipped — no MCP load attempt, no confusing error.
After the per-server narrowing step, the union widens scoped with any granted tool the engine actually loaded. serverId-aware attribution: a grant with serverId: "gmail" only widens when mcpResult.toolsByServer.gmail actually contains the bare tool name. Defense in depth against cross-server bare-name collisions. Legacy bare-name grants fall through to the existing bareToolName-in-filtered check.
Bypass mode (dangerouslySkipAllowlist) skips the query — it already widens past per-agent narrowing, so the union would be redundant. Failure to read grants → warn-log + continue with the un-widened toolset; no crash, no escalation.

3. `request_tool_access` distinguishes `allow_once` vs `allow_always` for the LLM

persistent: boolean is now first-class on every answered/bypass/persistent_allow response shape — previously present only when true. The LLM branches on a single field instead of parsing answer.
grantType lands in the operator info-log (allow_once / allow_always / bypass / persistent_allow / deny) so audits don't need to join against the elicitation store.
Mirrored on the chat-side parallel (packages/system/agents/workspace-chat/tools/request-tool-access.ts).
Tool descriptions and SKILL.md teach the LLM to read persistent and behave accordingly.

4. SKILL.md rewrite

Drops the "same action continues" framing that the code path never supported.
Re-documents request_tool_access as an approval signal: allow_once requires routing around in-action; allow_always makes the tool available from the next action onward.
Adds a "Reading the response" subsection covering persistent + the operator grantType log field.
Worked example switched from fs_write_file (ambient platform tool, no narrowing applies) to a workspace MCP tool, which is where the narrowing actually applies.

5. `vitest.setup.ts`

Bootstraps + binds the tool-access-grant adapter so every suite that exercises buildTools end-to-end has the storage facade available.

Test plan

deno task typecheck — green
deno task test — 6495 / 6514 pass, no regressions; the 19 skipped are pre-existing
deno run -A npm:@biomejs/biome check — 0 errors
Storage tests cover empty bucket, workspace scoping, reserved-character round-trip, serverId derivation from qualified toolName, and explicit-serverId override.
grant-union.test.ts covers (10 cases):
- happy path: granted workspace-MCP tool unions in
- no grant → declared-only narrowing holds
- grant for an unloaded tool doesn't materialise (defense in depth)
- bypass mode skips the grant query entirely (spy assertion)
- grant-read failure → warn-and-continue with un-widened toolset
- action without a tools: array (hasNameAllowlist === false) → no-op union
- serverId match widens; serverId mismatch does not
- eager-load happy path: grant-referenced server joins the MCP load
- eager-load skipped when the grant-referenced server isn't in workspace config

Compat

Schema change is additive. Existing grants deserialize cleanly; read-path infers serverId from a qualified toolName when feasible.
hasGrant key derivation unchanged — back-compat with both legacy and new grants.
request_tool_access response now always carries persistent: boolean. Previously present only when true. Existing exact-match tests updated; no behavioural regression.

Scope notes

allow_once still can't widen the current action. Mid-stream toolset rebuild against the AI SDK is meaningfully larger work; out of scope. SKILL.md tells authors to delegate / failStep on allow_once.
Adjacent paths don't need this. Audited apps/atlasd/routes/agents/run.ts and packages/workspace/src/runtime.ts — neither has per-action tools: narrowing, so grants have nothing to widen there. The fsm-engine buildTools path is the sole narrowing surface.

Closes #280.

`request_tool_access` advertised itself as a permission gate, but the toolset for an LLM action was snapshotted by `buildTools` and never reconsulted ToolAccessGrants. Authors granting `allow_always` for a workspace MCP tool got `granted: true`, then watched the next action fail to call the tool — the grant existed in storage but no production code path read it. Fixes the gap for future actions: - New `ToolAccessGrants.listForWorkspace` enumerates persistent grants by workspace via a KV prefix scan. - `buildTools` queries it once per action and unions any granted tool already present in `filtered` (post-PLATFORM_TOOL_ALLOWLIST) into `scoped`. Bypass mode skips the query — it already widens past per-agent narrowing. - vitest.setup bootstraps + binds the tool-access-grant storage so every suite that builds tools through the FSM engine has the adapter available. The `allow_once` case still cannot widen the current action (mid-stream toolset rebuild is a separate workstream, tracked in the PR). SKILL.md is rewritten to stop claiming "the same action continues" and to point at delegation / failStep as the route-around path.

…t branches Adds three test cases to grant-union.test.ts that lock the contract: - Bypass mode skips the grant query entirely (spy assertion on listForWorkspace, mirroring the "redundant under bypass" claim from the originating PR). - Grant store read failure → warn-and-continue with un-widened toolset (no crash, no escalation). KV down / deserialize-edge-case path. - Action without a `tools:` array (hasNameAllowlist === false) → union is a no-op because every loaded tool already flowed through. Most common ad-hoc LLM action shape; previously untested. Refactors the mock for listForWorkspace from a closure-bound function into a vi.fn() so call-count assertions are possible. RunOptions grows two parameters (`bypass`, full `grants` envelope) without changing the existing happy-path test shapes.

… logs After #302 the two grant flavors have meaningfully different runtime semantics: `allow_always` widens future actions in the workspace via the grant union; `allow_once` is an approval signal only. The tool surface collapsed both into the same `granted: true` shape, leaving the LLM without a programmatic way to tell "this will keep working" from "this is a one-shot". - Always include `persistent: boolean` in answered / bypass / persistent responses. Previously present only when true; now first-class on every shape so the LLM branches on a single field instead of parsing `answer`. - Add `grantType` to the `info` log lines (`allow_once` / `allow_always` / `bypass` / `persistent_allow` / `deny`) so operators reading `~/.atlas/logs/global.log` can audit choices without joining against the elicitation store. - Mirror both changes in the chat-side parallel (`packages/system/agents/workspace-chat/tools/request-tool-access.ts`). - SKILL.md gains a "Reading the response" subsection that teaches the LLM to branch on `persistent` and explains the `grantType` log field. - Tool descriptions surface the `persistent` semantics in the prompt so the LLM sees the new behavior at registration time. Existing exact-match `toEqual` tests updated; no behavioural regression.

…P servers #302 left a known limitation: a grant for `gmail/send_email` only worked when the action that benefits from it happened to load gmail anyway — typically because it declared a bare tool name (forcing all workspace MCP servers to load). For strictly-qualified actions like `tools: [google-calendar/list_events]`, the gmail server stays unloaded and the grant union finds nothing to widen. This closes the gap end-to-end: - `ToolAccessGrantSchema` gains an optional `serverId` field. Additive schema change — legacy grants keep deserializing; the read path infers serverId from a qualified `toolName` when feasible. - `grantAlways` derives `serverId` from the LLM-facing tool name when the caller doesn't pass one explicitly. KV key derivation is unchanged (still based on `toolName`) so `hasGrant` lookups by the original LLM-facing string keep working for both old and new grants. - `listForWorkspace` returns `ListedGrant[]` with `{ toolName, bareToolName, serverId? }`. `bareToolName` is what `mcpResult.tools` is keyed by; `serverId` is the source MCP server when known. - `fsm-engine.buildTools` hoists the grant fetch above the `effectiveConfigs` choice. Any grant-referenced server that the workspace still declares is eagerly added to the MCP load. Stale grants (server dropped from workspace config) are silently skipped to avoid surfacing a confusing MCP error. - The union step now does serverId-aware attribution: a grant with `serverId: "gmail"` only widens when `mcpResult.toolsByServer.gmail` actually contains the bare tool name. Defense in depth against cross-server bare-name collisions. Legacy bare-name grants fall through to the existing bareToolName-in-filtered check. - `request_tool_access` (MCP-server side) passes the parsed serverId through to `grantAlways` so new grants land with the field set. Two new fsm-engine tests cover the eager-load happy path (grant-referenced server is loaded; the granted tool reaches the LLM) and the stale-grant defensive path (server not in workspace config → silently skipped). The earlier `serverId mismatch must not widen` test covers the cross-server attribution check.

…tics PR #302 puts the burden of "approval signal vs. permission grant" on SKILL.md and the tool description — language. Unit tests prove the backend wiring; they can't prove the LLM actually follows the framing. Two new first-principles scenarios cover the model behavior: - `allow-once-route-around-check` — action calls `request_tool_access` for `fake_inbox/modify_message_labels`, harness answers `allow_once`, scenario asserts the model emits `persistent: false` in the structured reply AND does not call `modify_message_labels` in the same action (route-around path). - `allow-always-grant-check` + `allow-always-future-action-check` — step 1 calls `request_tool_access`, harness answers `allow_always`, asserts `persistent: true` + no in-action call to the tool; step 2 triggers a separate action in the same workspace and asserts the LLM DOES call `modify_message_labels` directly (no second `request_tool_access`), proving the grant union widens future actions end-to-end through the live daemon. Modeled on the existing `blocking-elicitation-check` pattern in the same fixture; no new MCP server / harness scaffolding. Workspace.yml gains three signals + three jobs; scenario file gains two runners wired into the main dispatch list between the blocking-elicitation and request-user-decision scenarios.

basedfriday · 2026-05-13T15:53:38Z

@odk- could the mechanism actually be for request_tool_access be to reflect the user decision for always allow back into the workspace.yml? For the allow_once use case I get that we'll need an ephemeral grant, but seems like config is the right place to track permanent grants.

…lways-grants

#297's refactor in main changed materializeFixture to take fridayHome as the first argument so the harness can place fixtures under the daemon's own home. The new allow_once / allow_always scenarios from PR #302 were still calling the old 2-arg form; merging main surfaced two TS2554 errors at typecheck. Pass `d.fridayHome` through, matching every other call site in the file. Pure mechanical fix — no behavior change.

odk- · 2026-05-14T09:49:48Z

@odk- could the mechanism actually be for request_tool_access be to reflect the user decision for always allow back into the workspace.yml? For the allow_once use case I get that we'll need an ephemeral grant, but seems like config is the right place to track permanent grants.

I don't think so. If you allow it, then export the workspace and I import it does it mean I get it auto allowed? If it should ask on import then we can also have it without storing other ppl decisions in the workspace.yml.

odk- force-pushed the fix/280-union-allow-always-grants branch from a17931b to 6a4c62c Compare May 13, 2026 14:03

odk- added 3 commits May 13, 2026 16:25

odk- mentioned this pull request May 13, 2026

ToolAccessGrants: server-aware grants + missing tests + LLM-readable response #304

Merged

5 tasks

odk- marked this pull request as ready for review May 13, 2026 15:22

odk- requested review from Vpr99 and ljagiello as code owners May 13, 2026 15:22

odk- added 2 commits May 14, 2026 10:48

Merge remote-tracking branch 'origin/main' into fix/280-union-allow-a…

d3ef2f3

…lways-grants

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FSM: union allow_always grants into action toolset (fixes #280)#302

FSM: union allow_always grants into action toolset (fixes #280)#302
odk- wants to merge 7 commits into
mainfrom
fix/280-union-allow-always-grants

odk- commented May 13, 2026 •

edited

Loading

Uh oh!

basedfriday commented May 13, 2026

Uh oh!

odk- commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

odk- commented May 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

What changes

1. ToolAccessGrants.listForWorkspace + storage server-awareness

2. buildTools union + eager-load

3. request_tool_access distinguishes allow_once vs allow_always for the LLM

4. SKILL.md rewrite

5. vitest.setup.ts

Test plan

Compat

Scope notes

Uh oh!

basedfriday commented May 13, 2026

Uh oh!

odk- commented May 14, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

odk- commented May 13, 2026 •

edited

Loading

1. `ToolAccessGrants.listForWorkspace` + storage server-awareness

2. `buildTools` union + eager-load

3. `request_tool_access` distinguishes `allow_once` vs `allow_always` for the LLM

5. `vitest.setup.ts`