Complete feature: embeddings batching, retries, managed identity #1114

dluc · 2025-12-18T16:54:24Z

Complete Embedding Generators and Cache feature by implementing remaining Must-Have acceptance criteria.

Changes:

Add configurable embeddings batch sizing (default 10) via EmbeddingsConfig.batchSize and chunking for batch-capable providers.
Add shared transient HTTP retry/backoff (honors Retry-After) and use it in OpenAI/Azure/HF/Ollama providers.
Add Azure OpenAI managed identity auth (DefaultAzureCredential) alongside API key auth.
Add HuggingFace HF_TOKEN environment variable fallback.
Add/adjust unit + integration tests and stabilize env-var dependent tests.

- Add EmbeddingsConfig.batchSize + validate provider configs - Implement shared HttpRetryPolicy (Retry-After + exponential backoff) and use it in OpenAI/Azure/HF/Ollama - Support Azure OpenAI managed identity via DefaultAzureCredential - Support HuggingFace HF_TOKEN fallback - Update docs + expand unit/integration tests; stabilize env-var tests

Restore the original OLLAMA_AVAILABLE gating and remove the unrequested FTS-only config change.

dluc · 2025-12-18T17:06:52Z

Restored to upstream behavior (reverted the FTS-only config + removed the accidental change to Ollama gating).

dluc · 2025-12-18T17:07:03Z

Restored tests/Main.Tests/Integration/SearchProcessTests.cs to upstream behavior (reverted the FTS-only config change and restored the original Ollama/OLLAMA_AVAILABLE gating).

- Add per-attempt timeout support and Ollama-specific timeout - Avoid retrying connection-refused/host-not-found to prevent long-running test hangs

dluc · 2025-12-18T17:29:00Z

Pushed a fix for the retry regression: HttpRetryPolicy now fails fast on connection-refused/host-not-found/network-unreachable and enforces per-attempt timeouts (Ollama uses a low timeout).

This prevents coverage.sh/test runs from appearing hung when Ollama is stopped.

Copilot

Pull request overview

This PR completes the embeddings feature by implementing batching with configurable batch sizes, adding HTTP retry/backoff logic for transient failures, supporting Azure OpenAI managed identity authentication, and adding HuggingFace HF_TOKEN environment variable fallback support. The changes include comprehensive test coverage and validation.

Key Changes

Embeddings Batching: Adds configurable BatchSize property (default: 10) to all embeddings configs and implements chunking logic in batch-capable providers (OpenAI, HuggingFace, Azure OpenAI)
HTTP Retry Policy: Implements a shared retry/backoff mechanism with exponential backoff, Retry-After header support, and per-attempt timeouts for all embedding providers
Azure Managed Identity: Adds support for DefaultAzureCredential authentication in Azure OpenAI alongside existing API key auth

Reviewed changes

Copilot reviewed 23 out of 23 changed files in this pull request and generated 8 comments.

Show a summary per file

File	Description
tests/Main.Tests/Services/EmbeddingGeneratorFactoryTests.cs	Adds test for HF_TOKEN environment variable fallback
tests/Core.Tests/TestCollections.cs	Defines non-parallel test collection for environment variable tests
tests/Core.Tests/Logging/SensitiveDataScrubbingPolicyTests.cs	Adds collection attribute for thread-safe environment variable tests
tests/Core.Tests/Logging/EnvironmentDetectorTests.cs	Adds collection attribute for thread-safe environment variable tests
tests/Core.Tests/Http/HttpRetryPolicyTests.cs	Adds basic retry policy tests for 429 responses
tests/Core.Tests/Embeddings/Providers/OpenAIEmbeddingGeneratorTests.cs	Updates tests with batchSize parameter and adds chunking test
tests/Core.Tests/Embeddings/Providers/OllamaEmbeddingGeneratorTests.cs	Adds delayAsync parameter for fast unit tests
tests/Core.Tests/Embeddings/Providers/HuggingFaceEmbeddingGeneratorTests.cs	Updates tests with batchSize parameter
tests/Core.Tests/Embeddings/Providers/AzureOpenAIEmbeddingGeneratorTests.cs	Adds managed identity authentication tests with TestTokenCredential
src/Main/Services/EmbeddingGeneratorFactory.cs	Updates factory to pass batchSize and handle HF_TOKEN fallback
src/Core/Http/HttpRetryPolicy.cs	Implements new shared retry policy with exponential backoff and Retry-After support
src/Core/Embeddings/Providers/OpenAIEmbeddingGenerator.cs	Adds batching with chunking and integrates retry policy
src/Core/Embeddings/Providers/OllamaEmbeddingGenerator.cs	Integrates retry policy (no batching as Ollama doesn't support it)
src/Core/Embeddings/Providers/HuggingFaceEmbeddingGenerator.cs	Adds batching with chunking and integrates retry policy
src/Core/Embeddings/Providers/AzureOpenAIEmbeddingGenerator.cs	Adds managed identity support, batching, and retry policy integration
src/Core/Core.csproj	Adds Azure.Identity package reference
src/Core/Constants.cs	Adds HttpRetryDefaults constants for retry behavior
src/Core/Config/Embeddings/OpenAIEmbeddingsConfig.cs	Adds BatchSize validation
src/Core/Config/Embeddings/OllamaEmbeddingsConfig.cs	Adds BatchSize validation
src/Core/Config/Embeddings/HuggingFaceEmbeddingsConfig.cs	Updates validation to allow HF_TOKEN env var and adds BatchSize validation
src/Core/Config/Embeddings/EmbeddingsConfig.cs	Adds BatchSize property to base config
src/Core/Config/Embeddings/AzureOpenAIEmbeddingsConfig.cs	Adds BatchSize validation
src/Core/Config/AppConfig.cs	Adds pragma to suppress CA1724 warning from Azure.Identity namespace conflict

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

src/Core/Embeddings/Providers/OpenAIEmbeddingGenerator.cs

src/Core/Http/HttpRetryPolicy.cs

src/Core/Embeddings/Providers/AzureOpenAIEmbeddingGenerator.cs

src/Main/Services/EmbeddingGeneratorFactory.cs

tests/Core.Tests/Http/HttpRetryPolicyTests.cs

src/Core/Http/HttpRetryPolicy.cs

tests/Core.Tests/Embeddings/Providers/AzureOpenAIEmbeddingGeneratorTests.cs

Resolve HF_TOKEN into HuggingFaceEmbeddingsConfig.ApiKey during validation so factories/providers rely only on config.

Add comment explaining the final throw is unreachable in normal flow and kept as defensive code.

dluc changed the title ~~Complete feature 00007: embeddings batching, retries, managed identity~~ Complete feature: embeddings batching, retries, managed identity Dec 18, 2025

Revert SearchProcessTests to Ollama-gated behavior

d341f7f

Restore the original OLLAMA_AVAILABLE gating and remove the unrequested FTS-only config change.

Fix HttpRetryPolicy: fail fast on unreachable services

e004fec

- Add per-attempt timeout support and Ollama-specific timeout - Avoid retrying connection-refused/host-not-found to prevent long-running test hangs

dluc requested a review from Copilot December 18, 2025 17:33

Copilot started reviewing on behalf of dluc December 18, 2025 17:34 View session

Copilot AI reviewed Dec 18, 2025

View reviewed changes

dluc added 2 commits December 18, 2025 18:47

Centralize HF_TOKEN handling in config

9a6049f

Resolve HF_TOKEN into HuggingFaceEmbeddingsConfig.ApiKey during validation so factories/providers rely only on config.

Docs: clarify defensive fallback in HttpRetryPolicy

d2b36e2

Add comment explaining the final throw is unreachable in normal flow and kept as defensive code.

dluc merged commit 94b69d3 into microsoft:main Dec 18, 2025
3 checks passed

dluc deleted the feat/00007-complete branch December 18, 2025 17:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Complete feature: embeddings batching, retries, managed identity #1114

Complete feature: embeddings batching, retries, managed identity #1114

Uh oh!

dluc commented Dec 18, 2025 •

edited

Loading

Uh oh!

dluc commented Dec 18, 2025

Uh oh!

dluc commented Dec 18, 2025

Uh oh!

dluc commented Dec 18, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Complete feature: embeddings batching, retries, managed identity #1114

Complete feature: embeddings batching, retries, managed identity #1114

Uh oh!

Conversation

dluc commented Dec 18, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dluc commented Dec 18, 2025

Uh oh!

dluc commented Dec 18, 2025

Uh oh!

dluc commented Dec 18, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Key Changes

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

dluc commented Dec 18, 2025 •

edited

Loading