perf: parallelize EmbedBatch using goroutines and errgroup by Kavirubc · Pull Request #107 · similigh/simili-bot

Kavirubc · 2026-03-11T10:04:51Z

Refactor EmbedBatch in internal/integrations/ai/embedder.go to process embedding requests concurrently instead of serially.

A semaphore channel caps in-flight goroutines at maxBatchConcurrency (10) to avoid overwhelming the API. Results are written to pre-indexed slice positions so no mutex is required. Context cancellation propagates via errgroup.WithContext.

Also adds four unit tests covering empty input, order preservation, error propagation, and the concurrency cap.

Closes #51

Generated with Claude Code

Summary by CodeRabbit

New Features
- Batch embedding now runs in parallel with built-in concurrency controls; results preserve input order.
Tests
- Added comprehensive tests for batch embedding covering empty input, order preservation, error propagation, and concurrency limits.
Chores
- Adjusted dependency management.

Refactor EmbedBatch in internal/integrations/ai/embedder.go to process embedding requests concurrently instead of serially. A semaphore channel caps in-flight goroutines at maxBatchConcurrency (10) to avoid overwhelming the API. Results are written to pre-indexed slice positions so no mutex is required. Also adds four unit tests covering empty input, order preservation, error propagation, and the concurrency cap. Closes #51 Co-authored-by: Kaviru Hapuarachchi <Kavirubc@users.noreply.github.com>

gh-simili-bot · 2026-03-11T10:06:03Z

Simili Triage Report

Note

Quality Score: 9.5/10 (Excellent)
The issue could be improved. See suggestions below.

Classification

Category	Value
Labels

Quality Improvements

Missing quantitative performance targets
No explicit discussion of potential resource implications
Add expected performance improvements (e.g., target speedup, latency reduction)
Briefly discuss potential resource implications (e.g., CPU, memory usage)

Similar Threads

Similarity	Type	Thread	Status
83%	📝	#51 [Performance]: Parallelize Batch Embedding Gene...	—
79%	📝	#51 [Performance]: Parallelize Batch Embedding Gene...	Open
77%	🔀	#99 feat(ai): OpenAI retry parity — typed error + t...	Open

Warning

Possible Duplicate (Confidence: 98%)
This pull request might be a duplicate of #51.
Reason: The current issue explicitly states 'Closes #51' and its description perfectly matches the problem statement and proposed solution in Issue #51. Both issues describe the exact same performance problem with EmbedBatch (serial processing) and propose the same solution of parallelizing it using goroutines and errgroup. Fixing the current issue fully resolves #51.
Also related to: #99 (feat(ai): OpenAI retry parity — typed error + tests (closes #69))

⏳ This pull request will be automatically closed in 72 hours if no objections are raised. If you believe this is not a duplicate, please leave a comment explaining why.

_{Generated by Simili Bot}

Copilot

Pull request overview

This PR refactors EmbedBatch to generate embeddings concurrently (bounded by a fixed concurrency limit) to speed up bulk indexing, and adds unit tests intended to validate the new behavior.

Changes:

Parallelize EmbedBatch using errgroup.WithContext plus a semaphore-style concurrency cap.
Add unit tests for empty input, order preservation, error propagation, and concurrency limiting.
Add golang.org/x/sync as a direct dependency.

Reviewed changes

Copilot reviewed 3 out of 3 changed files in this pull request and generated 5 comments.

File	Description
internal/integrations/ai/embedder.go	Implements concurrent `EmbedBatch` using `errgroup` and a semaphore-limited worker pattern.
internal/integrations/ai/embedder_test.go	Adds tests for `EmbedBatch` behavior (empty input, ordering, errors, concurrency).
go.mod	Adds `golang.org/x/sync` dependency for `errgroup`.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-03-11T10:08:41Z

internal/integrations/ai/embedder_test.go

+func TestEmbedBatch_ConcurrencyLimit(t *testing.T) {
+	// Verify that EmbedBatch honours maxBatchConcurrency: concurrent in-flight
+	// requests should never exceed the limit.
+	var (
+		inFlight    atomic.Int32
+		maxObserved atomic.Int32
+	)
+	srv := httptest.NewServer(http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+		cur := inFlight.Add(1)
+		defer inFlight.Add(-1)
+		// Update max observed — simple CAS loop.
+		for {
+			old := maxObserved.Load()
+			if cur <= old || maxObserved.CompareAndSwap(old, cur) {
+				break
+			}
+		}
+		w.Header().Set("Content-Type", "application/json")
+		w.WriteHeader(200)
+		_, _ = w.Write(embeddingOKBody())
+	}))
+	defer srv.Close()
+
+	e := newTestEmbedder(srv.URL)
+	e.retryConfig = fastRetry
+
+	texts := make([]string, maxBatchConcurrency*3)
+	for i := range texts {
+		texts[i] = fmt.Sprintf("text-%d", i)
+	}
+
+	if _, err := e.EmbedBatch(context.Background(), texts); err != nil {
+		t.Fatalf("unexpected error: %v", err)
+	}
+	if got := maxObserved.Load(); got > int32(maxBatchConcurrency) {
+		t.Errorf("max concurrent requests = %d, want <= %d", got, maxBatchConcurrency)
+	}
+}


TestEmbedBatch_ConcurrencyLimit may be ineffective/flaky because the handler responds immediately, so requests may not overlap enough to observe concurrency >1 (and the test only asserts maxObserved <= maxBatchConcurrency). Consider adding a small synchronized delay/barrier in the handler (e.g., block until N requests are in-flight or sleep for a short duration) so the test reliably exercises the concurrency limit and would fail if the limit were removed.

Copilot · 2026-03-11T10:08:42Z

go.mod

 	golang.org/x/crypto v0.31.0 // indirect
 	golang.org/x/net v0.33.0 // indirect
-	golang.org/x/sync v0.18.0 // indirect
+	golang.org/x/sync v0.18.0


golang.org/x/sync v0.18.0 is required twice (once in the main require block and again in the indirect require block). This duplication is atypical and will be rewritten by go mod tidy; it should appear only once (as a direct dependency).

Suggested change

golang.org/x/sync v0.18.0

Copilot · 2026-03-11T10:08:42Z

internal/integrations/ai/embedder.go

 	embeddings := make([][]float32, len(texts))
+	g, ctx := errgroup.WithContext(ctx)
+	sem := make(chan struct{}, maxBatchConcurrency)
+
 	for i, text := range texts {
-		embedding, err := e.Embed(ctx, text)
-		if err != nil {
-			return nil, fmt.Errorf("failed to embed text %d: %w", i, err)
-		}
-		embeddings[i] = embedding
+		i, text := i, text
+		g.Go(func() error {
+			sem <- struct{}{}
+			defer func() { <-sem }()
+
+			embedding, err := e.Embed(ctx, text)
+			if err != nil {
+				return fmt.Errorf("failed to embed text %d: %w", i, err)
+			}
+			embeddings[i] = embedding
+			return nil
+		})
+	}


The semaphore is acquired inside the goroutine, so EmbedBatch still spawns one goroutine per input text. For large batches this can create thousands of blocked goroutines and unnecessary memory/GC pressure; the concurrency cap only limits in-flight API calls, not goroutine count. Consider using errgroup.Group.SetLimit(maxBatchConcurrency) (or acquiring the semaphore before starting each goroutine / using a worker pool) so goroutine creation is also bounded.

Copilot · 2026-03-11T10:08:42Z

internal/integrations/ai/embedder.go

+		g.Go(func() error {
+			sem <- struct{}{}
+			defer func() { <-sem }()
+
+			embedding, err := e.Embed(ctx, text)


Semaphore acquisition (sem <- struct{}{}) does not respect context cancellation. If ctx is cancelled while this send is blocked (because the semaphore is full), the goroutine will remain stuck waiting for capacity, delaying cancellation and potentially causing EmbedBatch to hang until other calls return. Use a select on ctx.Done() when acquiring the semaphore (or rely on errgroup.SetLimit, which avoids this pattern).

Copilot · 2026-03-11T10:08:42Z

internal/integrations/ai/embedder_test.go

+func TestEmbedBatch_ResultsPreserveOrder(t *testing.T) {
+	// Track the order in which requests arrive — each call returns a unique embedding
+	// so we can verify the output is indexed correctly regardless of goroutine scheduling.
+	var counter atomic.Int32
+	srv := httptest.NewServer(http.HandlerFunc(func(w http.ResponseWriter, r *http.Request) {
+		n := counter.Add(1)
+		type embItem struct {
+			Embedding []float64 `json:"embedding"`
+		}
+		type embResp struct {
+			Data []embItem `json:"data"`
+		}
+		b, _ := json.Marshal(embResp{Data: []embItem{{Embedding: []float64{float64(n), 0.0, 0.0}}}})
+		w.Header().Set("Content-Type", "application/json")
+		w.WriteHeader(200)
+		_, _ = w.Write(b)
+	}))
+	defer srv.Close()
+
+	e := newTestEmbedder(srv.URL)
+	e.retryConfig = fastRetry
+
+	texts := make([]string, 20)
+	for i := range texts {
+		texts[i] = fmt.Sprintf("text-%d", i)
+	}
+
+	results, err := e.EmbedBatch(context.Background(), texts)
+	if err != nil {
+		t.Fatalf("unexpected error: %v", err)
+	}
+	if len(results) != len(texts) {
+		t.Fatalf("expected %d results, got %d", len(texts), len(results))
+	}
+	for i, emb := range results {
+		if len(emb) == 0 {
+			t.Errorf("result[%d] is empty", i)
+		}
+	}
+}


TestEmbedBatch_ResultsPreserveOrder doesn't actually assert order preservation. The handler returns embeddings based on request arrival order (a shared counter), but the test only checks that each result is non-empty, so it would pass even if EmbedBatch returned embeddings in the wrong order. To verify ordering, make the response embedding depend deterministically on the request's input text (e.g., parse the JSON body and encode the input index into the embedding) and assert results[i] matches texts[i].

coderabbitai · 2026-03-11T10:09:03Z

📝 Walkthrough

Walkthrough

The PR parallelizes batch embedding by refactoring EmbedBatch to use goroutines and errgroup with a concurrency limit of 10, and promotes golang.org/x/sync v0.18.0 from indirect to an explicit requirement. Tests for ordering, errors, empty input, and concurrency were added.

Changes

Cohort / File(s)	Summary
Dependency Management `go.mod`	Added explicit requirement for `golang.org/x/sync v0.18.0` (removed it from indirect block).
Batch Embedding Implementation `internal/integrations/ai/embedder.go`	Refactored `EmbedBatch` to run embedding requests in parallel using `errgroup` with `maxBatchConcurrency = 10`; preserves input order and aggregates errors.
Batch Embedding Tests `internal/integrations/ai/embedder_test.go`	Added tests: empty-input validation, result-order preservation, error propagation across batch requests, and concurrency-limit enforcement using `httptest` handlers and atomic counters.

Sequence Diagram(s)

sequenceDiagram
    autonumber
    participant Client as Caller
    participant Embedder as EmbedBatch
    participant Worker as Goroutine Pool
    participant API as Embedding Service

    Client->>Embedder: Call EmbedBatch([]texts)
    Embedder->>Worker: Spawn up to 10 workers (errgroup)
    loop for each text
        Worker->>API: Send embed request (concurrent)
        API-->>Worker: Return vector or error
        Worker-->>Embedder: Place result at corresponding index
    end
    Note right of Embedder: Wait for errgroup to finish\naggregate errors, preserve order
    Embedder-->>Client: Return ordered results or error

Estimated code review effort

🎯 3 (Moderate) | ⏱️ ~25 minutes

Poem

🐰✨ I hop in parallel, ten at play,
Goroutines hum and speed the day,
Results line up, errors caught in stride,
Embeddings bloom, side-by-side! 🚀

🚥 Pre-merge checks | ✅ 4 | ❌ 1

❌ Failed checks (1 warning)

Check name	Status	Explanation	Resolution
Docstring Coverage	⚠️ Warning	Docstring coverage is 0.00% which is insufficient. The required threshold is 80.00%.	Write docstrings for the functions missing them to satisfy the coverage threshold.

✅ Passed checks (4 passed)

Check name	Status	Explanation
Description Check	✅ Passed	Check skipped - CodeRabbit’s high-level summary is enabled.
Title check	✅ Passed	The title accurately captures the main change: parallelizing EmbedBatch using goroutines and errgroup for performance improvement.
Linked Issues check	✅ Passed	The PR implements the core requirement from `#51`: refactoring EmbedBatch to use goroutines and errgroup for concurrent embedding requests with controlled concurrency.
Out of Scope Changes check	✅ Passed	All changes are in scope: embedder.go modifications add parallelization, go.mod manages dependencies, and tests validate the new concurrent behavior.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch claude/issue-51-20260311-0959

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

Kavirubc · 2026-03-11T10:14:22Z

@simili-bot Fix the comments added by copilot

claude · 2026-03-11T10:15:17Z

Claude finished @Kavirubc's task in 2m 42s —— View job

Fixing Copilot Review Comments

Read source files
Fix: Use errgroup.SetLimit() instead of manual semaphore (unbounded goroutines + context cancellation)
Fix: TestEmbedBatch_ResultsPreserveOrder doesn't verify order
Fix: TestEmbedBatch_ConcurrencyLimit may be flaky (no delay to force overlap)
Fix: Duplicate golang.org/x/sync in go.mod
Commit and push

Summary of changes (commit `a957c34`)

internal/integrations/ai/embedder.go

Replaced the manual semaphore channel (sem <- struct{}{}) with g.SetLimit(maxBatchConcurrency). This bounds goroutine creation (not just in-flight API calls), and context cancellation is automatically respected since errgroup won't start new goroutines after the context is done — no select/ctx.Done() dance needed.

internal/integrations/ai/embedder_test.go

TestEmbedBatch_ResultsPreserveOrder: Handler now decodes the JSON request body, extracts the index from the input text ("text-N" → N), and returns [N+1, 0, 0] as the embedding. The test asserts results[i][0] == float32(i+1), which would fail if any result ended up at the wrong slice index.
TestEmbedBatch_ConcurrencyLimit: Added a 5ms sleep in the handler so concurrent requests actually overlap in-flight. Without this, fast local handlers would complete before the next goroutine starts, making the max-observed count always 1 and the test trivially pass even without any limit.
Added "time" to imports.

go.mod

Removed the duplicate golang.org/x/sync v0.18.0 line from the indirect require block (it already appears in the direct block).
| Branch: claude/issue-51-20260311-0959

- Replace manual semaphore channel with errgroup.SetLimit(maxBatchConcurrency) so goroutine creation is bounded and context cancellation is respected - Fix TestEmbedBatch_ResultsPreserveOrder to parse input text index from the request body and assert deterministic embedding values per position - Fix TestEmbedBatch_ConcurrencyLimit to add a 5ms handler delay so in-flight requests overlap and the concurrency cap is actually observable - Remove duplicate golang.org/x/sync entry from go.mod indirect block Co-authored-by: Kaviru Hapuarachchi <Kavirubc@users.noreply.github.com>

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

internal/integrations/ai/embedder.go (1)
184-194: Redundant loop variable capture in Go 1.22+.

Since this project uses Go 1.24.0, the i, text := i, text shadowing on line 185 is no longer necessary. Go 1.22+ creates fresh loop variables per iteration, eliminating the classic closure capture bug.
♻️ Suggested simplification
 	for i, text := range texts {
-		i, text := i, text
 		g.Go(func() error {
 			embedding, err := e.Embed(ctx, text)
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@internal/integrations/ai/embedder.go` around lines 184 - 194, The loop in
internal/integrations/ai/embedder.go unnecessarily re-shadows loop variables
with `i, text := i, text`; since Go 1.22+ (we're on 1.24) the loop creates fresh
variables per iteration, so remove the `i, text := i, text` line and let the
goroutine closure use the existing `i` and `text`; keep the body calling
`e.Embed(ctx, text)`, handling `err` and assigning into `embeddings[i]` as
before (inside the `g.Go` function).

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@internal/integrations/ai/embedder_test.go`:
- Around line 122-123: The fmt.Sscanf call that parses req.Input into idx must
have its return values checked to avoid silent failures; change the call in
embedder_test.go to capture the scan count and error (e.g. n, err :=
fmt.Sscanf(req.Input, "text-%d", &idx)), and if err != nil or n != 1 call
t.Fatalf or t.Errorf with a clear message including req.Input and the error so
the test fails loudly instead of leaving idx as 0.

---

Nitpick comments:
In `@internal/integrations/ai/embedder.go`:
- Around line 184-194: The loop in internal/integrations/ai/embedder.go
unnecessarily re-shadows loop variables with `i, text := i, text`; since Go
1.22+ (we're on 1.24) the loop creates fresh variables per iteration, so remove
the `i, text := i, text` line and let the goroutine closure use the existing `i`
and `text`; keep the body calling `e.Embed(ctx, text)`, handling `err` and
assigning into `embeddings[i]` as before (inside the `g.Go` function).

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: af70d79c-f55e-480d-9604-f7820c623136

📥 Commits

Reviewing files that changed from the base of the PR and between a42a602 and a957c34.

📒 Files selected for processing (3)

go.mod
internal/integrations/ai/embedder.go
internal/integrations/ai/embedder_test.go

coderabbitai · 2026-03-11T10:22:07Z

internal/integrations/ai/embedder_test.go

+		var idx int
+		fmt.Sscanf(req.Input, "text-%d", &idx)


⚠️ Potential issue | 🟡 Minor

Handle fmt.Sscanf error to satisfy linter and prevent silent failures.

Static analysis flags the unchecked return value. If parsing fails, idx remains 0, which could cause misleading test failures.

🛠️ Proposed fix

var idx int - fmt.Sscanf(req.Input, "text-%d", &idx) + if _, err := fmt.Sscanf(req.Input, "text-%d", &idx); err != nil { + http.Error(w, "invalid input format", http.StatusBadRequest) + return + }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

var idx int

fmt.Sscanf(req.Input, "text-%d", &idx)

var idx int

if _, err := fmt.Sscanf(req.Input, "text-%d", &idx); err != nil {

http.Error(w, "invalid input format", http.StatusBadRequest)

return

}

🧰 Tools

🪛 GitHub Check: Lint

[failure] 123-123:
Error return value of fmt.Sscanf is not checked (errcheck)

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@internal/integrations/ai/embedder_test.go` around lines 122 - 123, The fmt.Sscanf call that parses req.Input into idx must have its return values checked to avoid silent failures; change the call in embedder_test.go to capture the scan count and error (e.g. n, err := fmt.Sscanf(req.Input, "text-%d", &idx)), and if err != nil or n != 1 call t.Fatalf or t.Errorf with a clear message including req.Input and the error so the test fails loudly instead of leaving idx as 0.

Kavirubc · 2026-03-11T15:48:55Z

@simili-bot Review required
At least 1 approving review is required by reviewers with write access.

Some checks were not successful
1 failing, 5 successful checks

failing checks
CI / Lint (pull_request)
CI / Lint (pull_request)Failing after 37s
Required
successful checks
CI / Build & Test (pull_request)
CI / Build & Test (pull_request)Successful in 1m
Required
CI / Vet (pull_request)
CI / Vet (pull_request)Successful in 25s
Required
CodeRabbit
CodeRabbit — Review completed
DCO
DCOSuccessful in 1s — DCO
Simili Triage / triage (pull_request)
Simili Triage / triage (pull_request)Successful in 46s
Merging is blocked
At least 1 approving review is required by reviewers with write access.

Lint seems to be failling and code rabbit has added another comment as well, fix about and push to the same PR.

claude · 2026-03-11T15:49:57Z

Claude finished @Kavirubc's task in 1m 1s —— View job

Fixing Lint Failure & CodeRabbit Comment

Read current source files
Fix lint error: unchecked fmt.Sscanf return in embedder_test.go:123
Fix CodeRabbit nitpick: remove redundant i, text := i, text in embedder.go (Go 1.22+)
Commit and push

Changes (commit `b6c2825`)

internal/integrations/ai/embedder_test.go (line 123)

Wrapped fmt.Sscanf in an error check; if parsing fails the handler now returns HTTP 400 instead of silently leaving idx at 0. This satisfies the errcheck linter and prevents misleading test failures.

internal/integrations/ai/embedder.go (line 185)

Removed the redundant i, text := i, text loop-variable capture. Since the project targets Go 1.24 (≥ 1.22), the compiler already creates fresh variables per iteration — the shadowing line was a no-op.
| Branch: claude/issue-51-20260311-0959

…apture - Handle fmt.Sscanf return value in TestEmbedBatch_ResultsPreserveOrder to satisfy errcheck linter and fail loudly on malformed input - Remove i, text := i, text shadowing in EmbedBatch; Go 1.22+ creates fresh loop variables per iteration so the capture is unnecessary Co-authored-by: Kaviru Hapuarachchi <Kavirubc@users.noreply.github.com>

coderabbitai

Actionable comments posted: 1

🧹 Nitpick comments (1)

internal/integrations/ai/embedder.go (1)
22-23: Make the concurrency cap tunable.

A package constant hard-codes the same limit for every provider and deployment. Moving this to Embedder configuration would make rate-limit tuning and perf experiments possible without another code change.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In `@internal/integrations/ai/embedder.go` around lines 22 - 23, The hard-coded
maxBatchConcurrency constant should be made configurable on the Embedder so
different providers/deployments can tune concurrency; add a field (e.g.,
MaxBatchConcurrency int) to the Embedder struct with a sensible default (10)
applied in the Embedder constructor/initializer, remove or deprecate the
package-level const maxBatchConcurrency, and update any uses in EmbedBatch (and
related methods) to read e.MaxBatchConcurrency instead of the constant; ensure
validation (non-zero, positive) when constructing the Embedder.

🤖 Prompt for all review comments with AI agents

Verify each finding against the current code and only fix it if needed.

Inline comments:
In `@internal/integrations/ai/embedder.go`:
- Around line 181-196: Stop enqueuing new goroutines once the errgroup context
is canceled by checking the derived context before calling g.Go in the loop:
after obtaining g, ctx := errgroup.WithContext(ctx) and
g.SetLimit(maxBatchConcurrency), inside the for i, text := range texts loop test
if ctx.Err() != nil (or select on ctx.Done()) and break/return early to avoid
queuing canceled work; also capture loop variables (use i, text := i, text
inside the loop) so the closure passed to g.Go references the correct values
when invoking e.Embed(ctx, text) and writing embeddings[i].

---

Nitpick comments:
In `@internal/integrations/ai/embedder.go`:
- Around line 22-23: The hard-coded maxBatchConcurrency constant should be made
configurable on the Embedder so different providers/deployments can tune
concurrency; add a field (e.g., MaxBatchConcurrency int) to the Embedder struct
with a sensible default (10) applied in the Embedder constructor/initializer,
remove or deprecate the package-level const maxBatchConcurrency, and update any
uses in EmbedBatch (and related methods) to read e.MaxBatchConcurrency instead
of the constant; ensure validation (non-zero, positive) when constructing the
Embedder.

🪄 Autofix (Beta)

Fix all unresolved CodeRabbit comments on this PR:

Push a commit to this branch (recommended)
Create a new PR with the fixes

ℹ️ Review info

⚙️ Run configuration

Configuration used: Organization UI

Review profile: CHILL

Plan: Pro

Run ID: 5de49180-9d9a-4bc9-9500-35ebab699661

📥 Commits

Reviewing files that changed from the base of the PR and between a957c34 and b6c2825.

📒 Files selected for processing (2)

internal/integrations/ai/embedder.go
internal/integrations/ai/embedder_test.go

🚧 Files skipped from review as they are similar to previous changes (1)

internal/integrations/ai/embedder_test.go

coderabbitai · 2026-03-11T15:59:16Z

internal/integrations/ai/embedder.go

+	g, ctx := errgroup.WithContext(ctx)
+	g.SetLimit(maxBatchConcurrency)
+
 	for i, text := range texts {
-		embedding, err := e.Embed(ctx, text)
-		if err != nil {
-			return nil, fmt.Errorf("failed to embed text %d: %w", i, err)
-		}
-		embeddings[i] = embedding
+		g.Go(func() error {
+			embedding, err := e.Embed(ctx, text)
+			if err != nil {
+				return fmt.Errorf("failed to embed text %d: %w", i, err)
+			}
+			embeddings[i] = embedding
+			return nil
+		})
+	}
+
+	if err := g.Wait(); err != nil {
+		return nil, err


⚠️ Potential issue | 🟠 Major

🧩 Analysis chain

🏁 Script executed:

cat -n internal/integrations/ai/embedder.go | head -220 | tail -60

Repository: similigh/simili-bot

Length of output: 2061

🏁 Script executed:

cat -n internal/integrations/ai/embedder.go | head -210 | tail -80

Repository: similigh/simili-bot

Length of output: 2724

🏁 Script executed:

wc -l internal/integrations/ai/embedder.go

Repository: similigh/simili-bot

Length of output: 103

🏁 Script executed:

# Check the Go version requirement and imports head -20 internal/integrations/ai/embedder.go

Repository: similigh/simili-bot

Length of output: 441

🏁 Script executed:

# Search for other uses of errgroup in the codebase to see patterns rg "errgroup\." --type go -B 2 -A 5

Repository: similigh/simili-bot

Length of output: 567

🏁 Script executed:

# Check if there are any tests for EmbedBatch to understand expected behavior find . -name "*test.go" -type f | xargs grep -l "EmbedBatch" 2>/dev/null

Repository: similigh/simili-bot

Length of output: 106

🏁 Script executed:

cat -n internal/integrations/ai/embedder_test.go

Repository: similigh/simili-bot

Length of output: 8732

🏁 Script executed:

rg "maxBatchConcurrency" internal/integrations/ai/embedder.go -B 2 -A 2

Repository: similigh/simili-bot

Length of output: 654

🏁 Script executed:

# Check for Go version constraints in go.mod head -5 go.mod

Repository: similigh/simili-bot

Length of output: 122

🏁 Script executed:

# Look at errgroup usage patterns to understand the expected behavior better # Also check if there's documentation or comments about context handling rg "Wait\(\)" internal/integrations/ai/embedder.go -B 5 -A 5

Repository: similigh/simili-bot

Length of output: 196

🏁 Script executed:

# Look for any race detector runs or tests find . -name "*test.go" -type f -exec grep -l "race" {} \;

Repository: similigh/simili-bot

Length of output: 216

Stop queueing new embeds once the errgroup context is canceled.

g.Go() blocks when SetLimit is full and does not watch the derived context. After the first failure or an upstream cancellation, this loop continues walking the rest of texts and enqueuing canceled work, which makes large failing batches return much more slowly than necessary.

Suggested fix

- g, ctx := errgroup.WithContext(ctx) + g, groupCtx := errgroup.WithContext(ctx) g.SetLimit(maxBatchConcurrency) + stoppedEarly := false for i, text := range texts { + if groupCtx.Err() != nil { + stoppedEarly = true + break + } g.Go(func() error { - embedding, err := e.Embed(ctx, text) + embedding, err := e.Embed(groupCtx, text) if err != nil { return fmt.Errorf("failed to embed text %d: %w", i, err) } embeddings[i] = embedding return nil }) } if err := g.Wait(); err != nil { return nil, err } + if stoppedEarly && ctx.Err() != nil { + return nil, ctx.Err() + }

📝 Committable suggestion

‼️ IMPORTANT
Carefully review the code before committing. Ensure that it accurately replaces the highlighted code, contains no missing lines, and has no issues with indentation. Thoroughly test & benchmark the code to ensure it meets the requirements.

Suggested change

g, ctx := errgroup.WithContext(ctx)

g.SetLimit(maxBatchConcurrency)

for i, text := range texts {

embedding, err := e.Embed(ctx, text)

if err != nil {

return nil, fmt.Errorf("failed to embed text %d: %w", i, err)

}

embeddings[i] = embedding

g.Go(func() error {

embedding, err := e.Embed(ctx, text)

if err != nil {

return fmt.Errorf("failed to embed text %d: %w", i, err)

}

embeddings[i] = embedding

return nil

})

}

if err := g.Wait(); err != nil {

return nil, err

g, groupCtx := errgroup.WithContext(ctx)

g.SetLimit(maxBatchConcurrency)

stoppedEarly := false

for i, text := range texts {

if groupCtx.Err() != nil {

stoppedEarly = true

break

}

g.Go(func() error {

embedding, err := e.Embed(groupCtx, text)

if err != nil {

return fmt.Errorf("failed to embed text %d: %w", i, err)

}

embeddings[i] = embedding

return nil

})

}

if err := g.Wait(); err != nil {

return nil, err

}

if stoppedEarly && ctx.Err() != nil {

return nil, ctx.Err()

}

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed. In `@internal/integrations/ai/embedder.go` around lines 181 - 196, Stop enqueuing new goroutines once the errgroup context is canceled by checking the derived context before calling g.Go in the loop: after obtaining g, ctx := errgroup.WithContext(ctx) and g.SetLimit(maxBatchConcurrency), inside the for i, text := range texts loop test if ctx.Err() != nil (or select on ctx.Done()) and break/return early to avoid queuing canceled work; also capture loop variables (use i, text := i, text inside the loop) so the closure passed to g.Go references the correct values when invoking e.Embed(ctx, text) and writing embeddings[i].

gh-simili-bot · 2026-03-15T15:31:19Z

Simili Triage Report

Note

Quality Score: 8.8/10 (Good)
The issue could be improved. See suggestions below.

Classification

Category	Value
Labels

Quality Improvements

Missing baseline performance metrics for current EmbedBatch operation
Missing target performance metrics or expected gains after parallelization
Include current performance benchmarks for EmbedBatch (e.g., average time per batch, throughput) to establish a baseline.
Specify desired performance improvements or expected gains from the parallelization.

Similar Threads

Similarity	Type	Thread	Status
84%	📝	#51 [Performance]: Parallelize Batch Embedding Gene...	—
82%	📝	#51 [Performance]: Parallelize Batch Embedding Gene...	Open

Warning

Possible Duplicate (Confidence: 99%)
This pull request might be a duplicate of #51.
Reason: The current issue explicitly states 'Closes #51', indicating it is the direct implementation or resolution of the feature request described in Issue #51. Both issues describe the exact same problem (serial processing in EmbedBatch leading to performance issues) and propose/implement the exact same solution (parallelization using goroutines and errgroup). The root cause and expected outcome are identical, and a single fix fully resolves both.

⏳ This pull request will be automatically closed in 72 hours if no objections are raised. If you believe this is not a duplicate, please leave a comment explaining why.

_{Generated by Simili Bot}

gh-simili-bot · 2026-03-16T01:08:15Z

Simili Triage Report

Note

Quality Score: 8.8/10 (Good)
The issue could be improved. See suggestions below.

Classification

Category	Value
Labels

Quality Improvements

No explicit performance benchmarks or target metrics
Include expected performance gains or current performance bottlenecks with metrics (e.g., before/after benchmarks) to fully justify the 'perf' label

Similar Threads

Similarity	Type	Thread	Status
84%	📝	#51 [Performance]: Parallelize Batch Embedding Gene...	—
82%	📝	#51 [Performance]: Parallelize Batch Embedding Gene...	Open

Warning

Possible Duplicate (Confidence: 99%)
This pull request might be a duplicate of #51.
Reason: The current issue explicitly states 'Closes #51'. Both the current issue and Similar Issue #51 describe the exact same problem: the serial processing of the EmbedBatch function causing performance bottlenecks during bulk indexing. They also propose the identical solution: refactoring EmbedBatch to parallelize embedding requests using goroutines and errgroup. The current issue is the implementation that fully resolves and closes issue #51.

⏳ This pull request will be automatically closed in 72 hours if no objections are raised. If you believe this is not a duplicate, please leave a comment explaining why.

_{Generated by Simili Bot}

gh-simili-bot · 2026-03-16T01:12:50Z

🧪 E2E Test

✅ Bot responded: yes

| Auto-closer (dry-run) | processed: 0 closed: 0 grace: 0 human: 0 |

Test repo → gh-simili-bot/simili-e2e-23123710729
Run → logs

_{Auto-generated by E2E pipeline}

Kavirubc requested a review from Copilot March 11, 2026 10:04

Copilot started reviewing on behalf of Kavirubc March 11, 2026 10:05 View session

gh-simili-bot added enhancement New feature or request performance refactor labels Mar 11, 2026

Copilot AI reviewed Mar 11, 2026

View reviewed changes

coderabbitai bot reviewed Mar 11, 2026

View reviewed changes

Merge branch 'main' into claude/issue-51-20260311-0959

9b2f394

gh-simili-bot added the tests label Mar 15, 2026

Kavirubc added the e2e label Mar 16, 2026

Kavirubc merged commit a7e5722 into main Mar 16, 2026
10 checks passed

Kavirubc deleted the claude/issue-51-20260311-0959 branch March 16, 2026 02:29

Conversation

Kavirubc commented Mar 11, 2026 • edited by coderabbitai bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by CodeRabbit

Uh oh!

gh-simili-bot commented Mar 11, 2026

Simili Triage Report

Classification

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Walkthrough

Changes

Sequence Diagram(s)

Estimated code review effort

Poem

❌ Failed checks (1 warning)

Uh oh!

Kavirubc commented Mar 11, 2026

Uh oh!

claude bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fixing Copilot Review Comments

Summary of changes (commit a957c34)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

Kavirubc commented Mar 11, 2026

Uh oh!

claude bot commented Mar 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Fixing Lint Failure & CodeRabbit Comment

Changes (commit b6c2825)

Uh oh!

coderabbitai bot left a comment

Choose a reason for hiding this comment

Uh oh!

coderabbitai bot Mar 11, 2026

Choose a reason for hiding this comment

Uh oh!

gh-simili-bot commented Mar 15, 2026

Simili Triage Report

Classification

Uh oh!

gh-simili-bot commented Mar 16, 2026

Simili Triage Report

Classification

Uh oh!

gh-simili-bot commented Mar 16, 2026

🧪 E2E Test

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Kavirubc commented Mar 11, 2026 •

edited by coderabbitai bot

Loading

coderabbitai bot commented Mar 11, 2026 •

edited

Loading

claude bot commented Mar 11, 2026 •

edited

Loading

Summary of changes (commit `a957c34`)

claude bot commented Mar 11, 2026 •

edited

Loading

Changes (commit `b6c2825`)