Add tool approval support with approveToolCall/denyToolCall by sethconvex · Pull Request #222 · get-convex/agent

sethconvex · 2026-02-11T00:29:04Z

Summary

Implements human-in-the-loop tool approval using AI SDK v6's native collectToolApprovals(). When a tool has needsApproval: true, generation pauses with a tool-approval-request in the thread. The client saves an approval/denial response, then calls streamText to continue — the SDK handles tool execution and denial results automatically.

Core changes (~55 lines of new implementation):

Agent.approveToolCall() / Agent.denyToolCall() — save tool-approval-response messages
serializeNewMessagesInStep — accept newResponseMessages param for approval tool-results
previousResponseMessageCount tracking in save closure for cumulative step.response.messages

Bug fixes included:

Fix addMessages returning stale data after ctx.db.replace
Fix streamText not cleaning up DeltaStreamer on onStepFinish errors
Fix useDeltaStreams not clearing state when streams finish (caused ghost streaming bubble)

Test plan

All 218 existing tests pass (npm test)
Unit tests for approve/deny flows (src/client/approval.test.ts)

🤖 Generated with Claude Code

Summary by CodeRabbit

Release Notes

New Features
- Added tool approval workflow: users can now approve or deny tool execution requests with optional reasoning.
Bug Fixes
- Improved error handling for streaming operations to ensure proper cleanup on failures.
Tests
- Added comprehensive test suite for tool approval flows.
- Added Gemini compatibility validation tests.

coderabbitai · 2026-02-11T00:29:15Z

Warning

Rate limit exceeded

@sethconvex has exceeded the limit for the number of commits that can be reviewed per hour. Please wait 12 minutes and 6 seconds before requesting another review.

⌛ How to resolve this issue?

After the wait time has elapsed, a review can be triggered using the @coderabbitai review command as a PR comment. Alternatively, push new commits to this PR.

We recommend that you space out your commits to avoid hitting the rate limit.

🚦 How do rate limits work?

CodeRabbit enforces hourly rate limits for each developer per organization.

Our paid plans have higher rate limits than the trial, open-source and free plans. In all cases, we re-allow further reviews after a brief timeout.

Please see our FAQ for further information.

✨ Finishing Touches

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch elegant-tool-approval

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

sethconvex · 2026-02-11T00:29:20Z

Add tool approval example: agent, backend, UI, and E2E tests #226
Add tool approval support with approveToolCall/denyToolCall #222 👈 (View in Graphite)
Add compile-time errors for AI SDK v5 patterns and docs #225
Fix race condition: Atomic stream finish with message save #224
Fix example tools for AI SDK v6 and remove noisy warning #223
AI SDK v6 Support #216 : 1 other dependent PR (#217 )
main

This stack of pull requests is managed by Graphite. Learn more about stacking.

pkg-pr-new · 2026-02-11T00:29:40Z

Open in StackBlitz

npm i https://pkg.pr.new/get-convex/agent/@convex-dev/agent@222

commit: ca514fe

kerns · 2026-02-11T04:59:23Z

Possible bug in the addMessages mutation?

The finishStreamId arg (added for issue #181) isn't extracted in the destructuring on line 164 of messages.ts. It falls into ...rest, which gets spread into the message document via { ...rest, ...message }. The messages table schema doesn't have a finishStreamId field, so Convex rejects the insert. The fix is a one-liner — add finishStreamId, to the destructuring alongside promptMessageId, pendingMessageId, and hideFromUserIdSearch.

I patched this locally and it resolves the issue. This only triggers when the stream finish is batched with the message save (the atomic finish path from #181), so it doesn't surface until you hit the approval → continuation flow.

sethconvex · 2026-02-11T05:23:48Z

@kerns Good catch! This was indeed a bug — finishStreamId was leaking into messageDoc via ...rest. It's already been fixed in #224 (the downstack PR that cherry-picked the atomic stream finish fix). The destructuring now extracts finishStreamId before the rest spread.

coderabbitai

Actionable comments posted: 1

🤖 Fix all issues with AI agents

In `@src/client/approval.test.ts`:
- Around line 34-55: The function getApprovalIdFromSavedMessages currently types
its parameter too narrowly; change its parameter to accept the actual
MessageDoc[] shape (e.g., savedMessages: Array<MessageDoc> | undefined) and
update the accessors to safely handle optional message and content fields (use
savedMessage.message?.content and guard that content is an array before
flatMap). Keep the same logic for finding a part with type ===
"tool-approval-request" and the same runtime check that
approvalRequest.approvalId is a string before returning it, throwing the
existing error otherwise.

🧹 Nitpick comments (2)

src/client/approval.test.ts (2)
105-147: Consider moving test actions to a separate non-test file.

The static analysis tool (Biome) flags exports from test files as suspicious. The testApproveFlow and testDenyFlow actions are exported for use with initConvexTest, which is a valid pattern, but it may trigger lint warnings.

If this pattern is intentional and standard in your codebase, consider adding a Biome exception for this file. Otherwise, move the action definitions to a separate module (e.g., approval.test.helpers.ts) and import them in the test.

29-32: Module-level mutable state may cause test interference.

The usageCalls array is shared across all tests and manually cleared at the start of each test (usageCalls.length = 0). This works but is fragile if tests run in parallel or if a test forgets to clear the array.

Consider using a test-local variable or Vitest's beforeEach hook for more robust isolation.
♻️ Alternative: Use beforeEach for cleanup
+import { describe, expect, test, beforeEach } from "vitest";
-import { describe, expect, test } from "vitest";

 // Track usage handler calls to verify the full flow is exercised
 const usageCalls: LanguageModelUsage[] = [];

+beforeEach(() => {
+  usageCalls.length = 0;
+});
+
 // ... in tests, remove manual clearing:
-  test("approve: ...", async () => {
-    usageCalls.length = 0;
+  test("approve: ...", async () => {

src/client/approval.test.ts

kerns · 2026-02-11T05:41:01Z

Awesome. All credit to Opus on that one.😇

kerns · 2026-02-11T10:31:32Z

@sethconvex On #222 w/ approveToolCall() / denyToolCall() and the finishStreamId atomic stream finish fix 💥

CleanShot.2026-02-11.at.10.22.06-converted.mp4

Those animated tool call badges you see in the start (Checking memories...Deleting collection) – those don't come back after the first approval for "Creating links..." etc. Probably an issue on my side, but ....progress! Thanks again for your work on this.

ianmacartney

Really nice! If this is working for tool calls, works for me. only nit is to actually test that the message roles get saved / merged as expected

src/client/approval.test.ts

sethconvex · 2026-02-20T06:30:29Z

Addressed review feedback: replaced imprecise toBeGreaterThanOrEqual assertions with exact toEqual checks on threadMessageRoles, verifying the stored message order is [user, assistant, tool, tool, assistant]. Also manually verified with Gemini 2.5 Flash that both clean and consecutive-tool-message sequences are accepted without errors.

coderabbitai

Actionable comments posted: 2

🧹 Nitpick comments (1)

src/client/approval.test.ts (1)
28-32: Shared mutable state could cause flaky tests under parallel execution.

usageCalls is module-level shared state. While tests reset it with usageCalls.length = 0, if vitest runs tests in parallel within this file, usage data from one test could leak into another's assertions.

Consider isolating state per test or ensuring sequential execution:
♻️ Option 1: Return usage data from the action instead of module-level state
-// Track usage handler calls to verify the full flow is exercised
-const usageCalls: LanguageModelUsage[] = [];
-const testUsageHandler: UsageHandler = async (_ctx, args) => {
-  usageCalls.push(args.usage);
-};
+// Create a fresh usage tracker per agent to avoid shared state
+function createUsageTracker() {
+  const calls: LanguageModelUsage[] = [];
+  const handler: UsageHandler = async (_ctx, args) => {
+    calls.push(args.usage);
+  };
+  return { calls, handler };
+}
Then create trackers per agent and return usage data from the action.
♻️ Option 2: Use vitest's sequential test mode for this file

Add at the top of the describe block:
describe.sequential("Tool Approval Workflow", () => {
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@src/client/approval.test.ts` around lines 28 - 32, The test uses a
module-level mutable array usageCalls and a shared testUsageHandler which can
leak between parallel tests; to fix, stop using module-shared state by creating
per-test trackers (e.g., declare const usageCalls: LanguageModelUsage[] and
const testUsageHandler: UsageHandler inside each it/test or beforeEach so each
test gets a fresh array), or change the action to return usage data directly
from the function under test and assert on that returned value instead of
reading usageCalls, or alternatively enforce sequential execution for the suite
by replacing describe with describe.sequential; update references to usageCalls
and testUsageHandler accordingly.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@scripts/test-gemini-approval.ts`:
- Around line 113-115: Replace the catch parameter type from any to unknown and
narrow it before using .message: change the catch clause in
scripts/test-gemini-approval.ts to catch (e: unknown) and inside the block check
if e is an instance of Error (e.g., if (e instanceof Error) {
console.error("FAILED -", e.message.slice(0,500)) } else { console.error("FAILED
-", String(e).slice(0,500)) }) so you safely handle thrown non-Error values and
satisfy TypeScript strict mode.

In `@src/client/streamText.ts`:
- Around line 163-172: When catching errors from awaiting stream and calling
result.consumeStream() inside the try/catch, call the pending RPC finalizer
before notifying the streamer: add an awaited call to call.fail() (or await
call.fail(e instanceof Error ? e.message : String(e))) immediately before
awaiting streamer?.fail(...) so the pending message is finalized the same way as
the onError path; keep the existing rethrow of the original error after both
awaits.

---

Nitpick comments:
In `@src/client/approval.test.ts`:
- Around line 28-32: The test uses a module-level mutable array usageCalls and a
shared testUsageHandler which can leak between parallel tests; to fix, stop
using module-shared state by creating per-test trackers (e.g., declare const
usageCalls: LanguageModelUsage[] and const testUsageHandler: UsageHandler inside
each it/test or beforeEach so each test gets a fresh array), or change the
action to return usage data directly from the function under test and assert on
that returned value instead of reading usageCalls, or alternatively enforce
sequential execution for the suite by replacing describe with
describe.sequential; update references to usageCalls and testUsageHandler
accordingly.

coderabbitai · 2026-02-20T07:21:22Z

scripts/test-gemini-approval.ts

+  } catch (e: any) {
+    console.error("FAILED -", e.message?.slice(0, 500) ?? e);
+  }


⚠️ Potential issue | 🟡 Minor

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # Check the current state of the file at lines 113-115 if [ -f "scripts/test-gemini-approval.ts" ]; then echo "=== File content at lines 113-115 ===" sed -n '110,120p' "scripts/test-gemini-approval.ts" else echo "File not found" fi

Repository: get-convex/agent

Length of output: 392

🏁 Script executed:

#!/bin/bash # Search for all catch (e: any) patterns in TypeScript files echo "=== Search for catch (e: any) patterns ===" rg "catch\s*\(\s*e\s*:\s*any\s*\)" --type ts

Repository: get-convex/agent

Length of output: 155

🏁 Script executed:

#!/bin/bash # Check TypeScript and ESLint configuration echo "=== TypeScript config ===" if [ -f "tsconfig.json" ]; then cat tsconfig.json | head -50 fi echo "" echo "=== ESLint config ===" if [ -f ".eslintrc.json" ] || [ -f ".eslintrc.js" ] || [ -f ".eslintrc.cjs" ]; then ls -la .eslintrc* 2>/dev/null && cat .eslintrc* | head -50 else # Check if eslint config is in package.json if [ -f "package.json" ]; then grep -A 20 '"eslintConfig"' package.json || echo "No eslintConfig in package.json" fi fi

Repository: get-convex/agent

Length of output: 868

Replace any in catch with unknown and narrow.

The code at lines 113-115 uses catch (e: any), which violates TypeScript's strict mode. Use unknown and narrow to Error for safer handling that accounts for JavaScript's ability to throw non-Error values.

🔧 Suggested fix

- } catch (e: any) { - console.error("FAILED -", e.message?.slice(0, 500) ?? e); + } catch (e: unknown) { + const message = + e instanceof Error ? e.message?.slice(0, 500) : String(e); + console.error("FAILED -", message); }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

} catch (e: any) {

console.error("FAILED -", e.message?.slice(0, 500) ?? e);

}

} catch (e: unknown) {

const message =

e instanceof Error ? e.message?.slice(0, 500) : String(e);

console.error("FAILED -", message);

}

🧰 Tools

🪛 GitHub Check: Test and lint

[failure] 113-113:
Unexpected any. Specify a different type

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@scripts/test-gemini-approval.ts` around lines 113 - 115, Replace the catch parameter type from any to unknown and narrow it before using .message: change the catch clause in scripts/test-gemini-approval.ts to catch (e: unknown) and inside the block check if e is an instance of Error (e.g., if (e instanceof Error) { console.error("FAILED -", e.message.slice(0,500)) } else { console.error("FAILED -", String(e).slice(0,500)) }) so you safely handle thrown non-Error values and satisfy TypeScript strict mode.

coderabbitai · 2026-02-20T07:21:22Z

src/client/streamText.ts

+    try {
+      await stream;
+      await result.consumeStream();
+    } catch (e) {
+      // If the stream errored (e.g. onStepFinish threw), the DeltaStreamer's
+      // finish() was never called, leaving the streaming message stuck in
+      // "streaming" state. Clean it up by marking it as aborted.
+      await streamer?.fail(e instanceof Error ? e.message : String(e));
+      throw e;
+    }


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

#!/bin/bash # First, verify the file exists and examine the context around lines 163-172 if [ -f "src/client/streamText.ts" ]; then echo "=== File exists, examining context around lines 163-172 ===" sed -n '140,190p' src/client/streamText.ts | cat -n else echo "File not found" fi

Repository: get-convex/agent

Length of output: 2160

🏁 Script executed:

#!/bin/bash # Search for the streamText function definition to understand the full context rg -n "export.*streamText|async.*streamText" src/client/streamText.ts -A 5

Repository: get-convex/agent

Length of output: 234

🏁 Script executed:

#!/bin/bash # Look for where 'call' is defined in this file rg -n "call\s*=" src/client/streamText.ts | head -20

Repository: get-convex/agent

Length of output: 42

🏁 Script executed:

#!/bin/bash # Find all call sites of streamText rg -n "streamText\(" src -g'*.ts' -B 2 -A 5

Repository: get-convex/agent

Length of output: 2237

🏁 Script executed:

#!/bin/bash # Get the full streamText function signature and beginning sed -n '34,80p' src/client/streamText.ts | cat -n

Repository: get-convex/agent

Length of output: 2102

🏁 Script executed:

#!/bin/bash # Search for call.fail usage to understand what it does rg -n "call\.fail" src -g'*.ts' -B 3 -A 3

Repository: get-convex/agent

Length of output: 1303

🏁 Script executed:

#!/bin/bash # Look for where 'call' parameter is defined and what type it is rg -n "call:" src/client/streamText.ts -A 2 -B 2

Repository: get-convex/agent

Length of output: 42

🏁 Script executed:

#!/bin/bash # Search for pending message handling and call.save usage rg -n "call\.save|pending" src/client/streamText.ts -B 2 -A 2

Repository: get-convex/agent

Length of output: 878

🏁 Script executed:

#!/bin/bash # Check error handling at streamText call sites rg -n "streamText\(" src -g'*.ts' -A 10 | grep -E "(streamText|catch|try|\.then|error)" | head -40

Repository: get-convex/agent

Length of output: 350

🏁 Script executed:

#!/bin/bash # Find startGeneration to understand what call object contains rg -n "export.*startGeneration|function startGeneration" src -g'*.ts' -A 20

Repository: get-convex/agent

Length of output: 2346

🏁 Script executed:

#!/bin/bash # Look for the full context of onError handler to see the pattern sed -n '115,130p' src/client/streamText.ts | cat -n

Repository: get-convex/agent

Length of output: 793

🏁 Script executed:

#!/bin/bash # Search for type definitions of the call object rg -n "call\." src/client/streamText.ts | head -20

Repository: get-convex/agent

Length of output: 376

🏁 Script executed:

#!/bin/bash # Get the full startGeneration function to see what it returns sed -n '30,120p' src/client/start.ts | cat -n

Repository: get-convex/agent

Length of output: 3665

🏁 Script executed:

#!/bin/bash # Look for the return type or return statement of startGeneration rg -n "return.*{" src/client/start.ts | head -5

Repository: get-convex/agent

Length of output: 141

🏁 Script executed:

#!/bin/bash # Check if there's documentation about what fail() does rg -n "\.fail.*=" src/client/start.ts -B 5 -A 5

Repository: get-convex/agent

Length of output: 42

🏁 Script executed:

#!/bin/bash # Look for the full startGeneration implementation - it may be long wc -l src/client/start.ts

Repository: get-convex/agent

Length of output: 83

🏁 Script executed:

#!/bin/bash # Find the fail function implementation in startGeneration sed -n '90,200p' src/client/start.ts | cat -n

Repository: get-convex/agent

Length of output: 4054

🏁 Script executed:

#!/bin/bash # Search for where fail is defined/assigned rg -n "fail\s*=" src/client/start.ts -B 2 -A 10

Repository: get-convex/agent

Length of output: 506

🏁 Script executed:

#!/bin/bash # Look at the full error handling flow in streamText - show more context around the try/catch sed -n '155,185p' src/client/streamText.ts | cat -n

Repository: get-convex/agent

Length of output: 1353

🏁 Script executed:

#!/bin/bash # Look at how other functions handle errors with call.fail rg -n "call\.fail" src -g'*.ts' -B 5 -A 2 | head -50

Repository: get-convex/agent

Length of output: 1558

🏁 Script executed:

#!/bin/bash # Check if there are any other try/catch blocks in streamText.ts that might show the pattern sed -n '115,135p' src/client/streamText.ts | cat -n

Repository: get-convex/agent

Length of output: 1059

🏁 Script executed:

#!/bin/bash # Check if pendingMessage is created and what state it starts in rg -n "pendingMessage" src/client/start.ts -B 2 -A 2 | head -40

Repository: get-convex/agent

Length of output: 1466

🏁 Script executed:

#!/bin/bash # Check the saveInputMessages function to understand pending message creation rg -n "export.*saveInputMessages|function saveInputMessages" src -g'*.ts' -A 10

Repository: get-convex/agent

Length of output: 682

🏁 Script executed:

#!/bin/bash # Look at the full onError handler pattern to understand the expected behavior sed -n '115,130p' src/client/streamText.ts

Repository: get-convex/agent

Length of output: 681

🏁 Script executed:

#!/bin/bash # Let's verify what pendingMessage state means by checking the message finalization rg -n "finalizeMessage" src -g'*.ts' -B 2 -A 5 | head -40

Repository: get-convex/agent

Length of output: 2099

🏁 Script executed:

#!/bin/bash # Check if streaming creates its own pending messages or uses the same one rg -n "createPendingMessage" src/client/streamText.ts -B 3 -A 3

Repository: get-convex/agent

Length of output: 658

🏁 Script executed:

#!/bin/bash # Check if there's any other cleanup that might handle this case rg -n "onStepFinish|pendingFinalStep" src/client/streamText.ts -B 2 -A 5

Repository: get-convex/agent

Length of output: 1933

🏁 Script executed:

#!/bin/bash # Verify the exact pattern in onError vs the try/catch to confirm the discrepancy sed -n '121,125p' src/client/streamText.ts && echo "---" && sed -n '163,172p' src/client/streamText.ts

Repository: get-convex/agent

Length of output: 683

🏁 Script executed:

#!/bin/bash # Confirm whether pendingMessage would exist when saveStreamDeltas is enabled # Check the conditions under which streamText creates a pending message sed -n '136,156p' src/client/start.ts

Repository: get-convex/agent

Length of output: 839

🏁 Script executed:

#!/bin/bash # Check what saveStreamDeltas options require to understand if pending message is always created rg -n "saveStreamDeltas" src/client/streamText.ts -B 2 -A 2 | head -30

Repository: get-convex/agent

Length of output: 997

🏁 Script executed:

#!/bin/bash # Final verification: check if call.fail is passed to DeltaStreamer's onAsyncAbort sed -n '86,100p' src/client/streamText.ts

Repository: get-convex/agent

Length of output: 512

🏁 Script executed:

#!/bin/bash # Verify what streamer?.fail() does - does it indirectly call call.fail? rg -n "\.fail.*=" src/client/streaming.ts | head -20

Repository: get-convex/agent

Length of output: 42

🏁 Script executed:

#!/bin/bash # Search for DeltaStreamer.fail implementation to confirm it doesn't call call.fail rg -n "class DeltaStreamer" src -g'*.ts' -A 150 | grep -E "(fail|constructor|onAsyncAbort)" | head -20

Repository: get-convex/agent

Length of output: 484

🏁 Script executed:

#!/bin/bash # Double-check: in what scenarios is saveStreamDeltas AND threadId both enabled? rg -n "saveStreamDeltas.*true|saveStreamDeltas.*{" src -g'*.ts' -B 5 -A 2 | head -30

Repository: get-convex/agent

Length of output: 207

🏁 Script executed:

#!/bin/bash # Find the fail method in DeltaStreamer rg -n "\.fail\s*\(" src/client/streaming.ts -B 2 -A 10

Repository: get-convex/agent

Length of output: 42

🏁 Script executed:

#!/bin/bash # Search for the fail method definition in DeltaStreamer class sed -n '200,400p' src/client/streaming.ts | grep -A 10 "fail"

Repository: get-convex/agent

Length of output: 368

Add call.fail() to finalize pending messages on stream-consumption errors.

The onError handler correctly calls call.fail() before streamer?.fail() to finalize the pending message. However, the try/catch block around stream consumption (lines 163–172) only calls streamer?.fail(), leaving the pending message un-finalized if result.consumeStream() throws.

Suggested fix

} catch (e) { // If the stream errored (e.g. onStepFinish threw), the DeltaStreamer's // finish() was never called, leaving the streaming message stuck in // "streaming" state. Clean it up by marking it as aborted. + const reason = e instanceof Error ? e.message : String(e); + await call.fail(reason); await streamer?.fail(e instanceof Error ? e.message : String(e)); throw e; }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

try {

await stream;

await result.consumeStream();

} catch (e) {

// If the stream errored (e.g. onStepFinish threw), the DeltaStreamer's

// finish() was never called, leaving the streaming message stuck in

// "streaming" state. Clean it up by marking it as aborted.

await streamer?.fail(e instanceof Error ? e.message : String(e));

throw e;

}

try {

await stream;

await result.consumeStream();

} catch (e) {

// If the stream errored (e.g. onStepFinish threw), the DeltaStreamer's

// finish() was never called, leaving the streaming message stuck in

// "streaming" state. Clean it up by marking it as aborted.

const reason = e instanceof Error ? e.message : String(e);

await call.fail(reason);

await streamer?.fail(e instanceof Error ? e.message : String(e));

throw e;

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@src/client/streamText.ts` around lines 163 - 172, When catching errors from awaiting stream and calling result.consumeStream() inside the try/catch, call the pending RPC finalizer before notifying the streamer: add an awaited call to call.fail() (or await call.fail(e instanceof Error ? e.message : String(e))) immediately before awaiting streamer?.fail(...) so the pending message is finalized the same way as the onError path; keep the existing rethrow of the original error after both awaits.

sethconvex · 2026-02-20T07:25:26Z

Merge activity

Feb 20, 7:25 AM UTC: A user started a stack merge that includes this pull request via Graphite.
Feb 20, 7:30 AM UTC: Graphite rebased this pull request as part of a merge.
Feb 20, 7:31 AM UTC: @sethconvex merged this pull request with Graphite.

Implements human-in-the-loop approval for tool calls using AI SDK v6's native collectToolApprovals(). When a tool has needsApproval, generation pauses with a tool-approval-request. After the user decides, calling streamText with the response message continues generation automatically. Core changes: - Agent.approveToolCall/denyToolCall save tool-approval-response messages - serializeNewMessagesInStep accepts newResponseMessages for approval flows - previousResponseMessageCount tracking for cumulative step.response.messages - Fix addMessages returning stale data after ctx.db.replace - Fix streamText not cleaning up DeltaStreamer on onStepFinish errors - Fix useDeltaStreams not clearing state when streams finish - Add realistic DEFAULT_USAGE in mockModel for AI SDK v6 - Unit tests for approve/deny flows Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

Apply suggested changes

## Summary Example implementation demonstrating the tool approval flow from #222: - **Agent** (`example/convex/agents/approval.ts`): `delete_file` tool with `needsApproval: true` - **Backend** (`example/convex/chat/approval.ts`): `sendMessage`, `submitApproval`, `handleApprovalDecision` - **React UI** (`example/ui/chat/ChatApproval.tsx`): Approve/Deny buttons with denial reason input, type-safe approval helpers - **E2E tests** (`example/convex/approval.test.ts`): Approve and deny flows through usageHandler ## Test plan - [x] E2E approval test: approve flow executes tool and continues generation - [x] E2E approval test: deny flow produces denial acknowledgment from model - [x] Manual test: Send → Approve → clean completion (no stuck streaming) - [x] Manual test: Send → Deny → model acknowledges denial 🤖 Generated with [Claude Code](https://claude.com/claude-code)

sethconvex changed the base branch from main to graphite-base/222 February 11, 2026 00:36

sethconvex force-pushed the graphite-base/222 branch from 5dfefff to 231f81f Compare February 11, 2026 00:36

sethconvex changed the base branch from graphite-base/222 to rc/ai-sdk-v6 February 11, 2026 00:36

This was referenced Feb 11, 2026

AI SDK v6 Support #216

Merged

Add auto-continuation for tool approval workflow #217

Open

sethconvex changed the title ~~Upgrade core library to AI SDK 6.0~~ Add tool approval support with approveToolCall/denyToolCall Feb 11, 2026

sethconvex changed the base branch from rc/ai-sdk-v6 to graphite-base/222 February 11, 2026 00:49

sethconvex force-pushed the elegant-tool-approval branch from c4f452b to e27ca09 Compare February 11, 2026 00:49

sethconvex changed the base branch from graphite-base/222 to v5-compat-errors-and-docs February 11, 2026 00:49

This was referenced Feb 11, 2026

Fix example tools for AI SDK v6 and remove noisy warning #223

Merged

Fix race condition: Atomic stream finish with message save #224

Merged

Add compile-time errors for AI SDK v5 patterns and docs #225

Merged

sethconvex force-pushed the elegant-tool-approval branch from e27ca09 to 6915b41 Compare February 11, 2026 01:14

sethconvex force-pushed the v5-compat-errors-and-docs branch from 5283806 to f5a0659 Compare February 11, 2026 01:14

sethconvex marked this pull request as ready for review February 11, 2026 01:14

sethconvex marked this pull request as draft February 11, 2026 01:15

sethconvex force-pushed the elegant-tool-approval branch 2 times, most recently from 79fab8a to c70a594 Compare February 11, 2026 05:21

sethconvex mentioned this pull request Feb 11, 2026

Add tool approval example: agent, backend, UI, and E2E tests #226

Merged

4 tasks

coderabbitai bot reviewed Feb 11, 2026

View reviewed changes

src/client/approval.test.ts Show resolved Hide resolved

sethconvex force-pushed the elegant-tool-approval branch from c70a594 to ef701ff Compare February 11, 2026 05:31

sethconvex marked this pull request as ready for review February 17, 2026 22:31

ianmacartney approved these changes Feb 20, 2026

View reviewed changes

src/client/approval.test.ts Outdated Show resolved Hide resolved

sethconvex force-pushed the elegant-tool-approval branch from 022a5b8 to 52b4b56 Compare February 20, 2026 06:35

sethconvex force-pushed the v5-compat-errors-and-docs branch from f5a0659 to 251d0c7 Compare February 20, 2026 06:35

sethconvex force-pushed the elegant-tool-approval branch from 52b4b56 to ffb13f8 Compare February 20, 2026 06:51

sethconvex force-pushed the v5-compat-errors-and-docs branch 2 times, most recently from bdbdde1 to 269bb8b Compare February 20, 2026 06:59

sethconvex force-pushed the elegant-tool-approval branch from ffb13f8 to 3953862 Compare February 20, 2026 06:59

sethconvex force-pushed the v5-compat-errors-and-docs branch from 269bb8b to 93fec7b Compare February 20, 2026 07:12

sethconvex force-pushed the elegant-tool-approval branch from 3953862 to 4e8cd39 Compare February 20, 2026 07:12

coderabbitai bot reviewed Feb 20, 2026

View reviewed changes

sethconvex force-pushed the v5-compat-errors-and-docs branch from 93fec7b to f2b2d4a Compare February 20, 2026 07:22

sethconvex force-pushed the elegant-tool-approval branch from 4e8cd39 to 2bfbdd6 Compare February 20, 2026 07:23

sethconvex changed the base branch from v5-compat-errors-and-docs to graphite-base/222 February 20, 2026 07:28

sethconvex changed the base branch from graphite-base/222 to main February 20, 2026 07:29

sethconvex and others added 6 commits February 20, 2026 07:30

Apply suggested changes

752ff66

Apply suggested changes

[checkpoint] Auto-save at 10:20:48 PM

f7b71c2

[checkpoint] Auto-save at 10:25:48 PM

3954f2b

[checkpoint] Auto-save at 10:30:48 PM

b5b582e

Remove leftover Gemini test script

4e7eed3

sethconvex force-pushed the elegant-tool-approval branch from ca514fe to 4e7eed3 Compare February 20, 2026 07:30

sethconvex merged commit 5110045 into main Feb 20, 2026
3 checks passed

-  } catch (e: any) {
-    console.error("FAILED -", e.message?.slice(0, 500) ?? e);
-  }
+  } catch (e: unknown) {
+    const message =
+      e instanceof Error ? e.message?.slice(0, 500) : String(e);
+    console.error("FAILED -", message);
+  }

Comments

Conversation

sethconvex commented Feb 11, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Test plan

Summary by CodeRabbit

Release Notes

Uh oh!

coderabbitai bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Rate limit exceeded

Uh oh!

sethconvex commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pkg-pr-new bot commented Feb 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

kerns commented Feb 11, 2026

Uh oh!

sethconvex commented Feb 11, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

kerns commented Feb 11, 2026

Uh oh!

kerns commented Feb 11, 2026

Uh oh!

ianmacartney left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

sethconvex commented Feb 20, 2026

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Feb 20, 2026

Choose a reason for hiding this comment

Uh oh!

sethconvex commented Feb 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merge activity

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

sethconvex commented Feb 11, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Feb 11, 2026 •

edited

Loading

sethconvex commented Feb 11, 2026 •

edited

Loading

pkg-pr-new bot commented Feb 11, 2026 •

edited

Loading

sethconvex commented Feb 20, 2026 •

edited

Loading