Bug: Local LLM setup fails — missing env loading and vector dimension mismatch

<html><head></head><body><h2>Bug Report</h2><p>When following the <strong><code inline="">SETUP.md</code></strong> instructions for local development using <strong>Ollama (Section 4)</strong>, the backend fails to start and the knowledge base upload endpoint returns errors.</p><hr><h2>Bug 1: Backend fails to start with <code inline="">Field required</code> errors</h2><h3>Steps to Reproduce</h3><ol><li><p>Follow <code inline="">SETUP.md</code> to configure <code inline="">backend/.env</code> with <code inline="">PROVIDER=local</code> and Ollama settings.</p></li><li><p>Run the backend directly:</p></li></ol><pre><code class="language-bash">poetry run uvicorn app.main:app --reload --host 0.0.0.0 --port 8000
</code></pre><ol start="3"><li><p>Observe all settings fields reported as missing.</p></li></ol><h3>Expected Behavior</h3><p>The server should start successfully, reading configuration from <code inline="">backend/.env</code>.</p><h3>Actual Behavior</h3><pre><code>AZURE_DEPLOYMENT_NAME
  Field required [type=missing, input_value={}, input_type=dict]

AZURE_EMBEDDING_DEPLOYMENT_NAME
  Field required [type=missing, input_value={}, input_type=dict]

QDRANT_URL
  Field required [type=missing, input_value={}, input_type=dict]
</code></pre><h3>Root Cause</h3><p><code inline="">Settings(BaseSettings)</code> in <code inline="">config.py</code> does not configure <code inline="">env_file</code>, so <strong>Pydantic only reads from OS environment variables</strong>, not from the <code inline="">.env</code> file.</p><p>This works with <strong>Docker Compose</strong> (which injects env vars via <code inline="">env_file:</code> in <code inline="">docker-compose.yml</code>) but fails when running the backend directly.</p><hr><h2>Bug 2: Knowledge base PDF upload fails with vector dimension mismatch</h2><h3>Steps to Reproduce</h3><ol><li><p>Configure <code inline="">PROVIDER=local</code> with<br><code inline="">AZURE_EMBEDDING_DEPLOYMENT_NAME=nomic-embed-text</code> (as recommended in <code inline="">SETUP.md</code>)</p></li><li><p>Start Qdrant and the backend.</p></li><li><p>Upload a PDF via:</p></li></ol><pre><code>POST /kb/upload-pdf
</code></pre><h3>Expected Behavior</h3><p>The PDF should be <strong>chunked, embedded, and stored in Qdrant successfully</strong>.</p><h3>Actual Behavior</h3><pre><code class="language-json">{
  "status": "error",
  "message": "Error uploading PDF file: Unexpected Response: 400 (Bad Request)\nRaw response content:\nb'{\"status\":{\"error\":\"Wrong input: Vector dimension error: expected dim: 1536, got 768\"}}'"
}
</code></pre><h3>Root Cause</h3><p><code inline="">create_knowledge_base_collection_if_not_exists()</code> in <code inline="">knowledge_base_service.py</code> <strong>hardcodes</strong></p><pre><code class="language-python">vector_size = 1536
</code></pre><p>This matches Azure OpenAI’s <code inline="">text-embedding-ada-002</code>, but <strong><code inline="">nomic-embed-text</code> produces 768-dimensional vectors</strong>, which causes the mismatch.</p><hr><h2> Additional: <code inline="">qdrant_storage/</code> not ignored in git</h2><p>When running Qdrant with the Docker command from <code inline="">SETUP.md</code> : the generated <code inline="">.json</code>, <code inline="">.dat</code>, and <code inline="">.mmap</code> storage files are <strong>not ignored</strong> and can accidentally be committed.</p><hr><h2>Proposed Fix</h2>
File | Change
-- | --
backend/app/core/config.py | Add SettingsConfigDict(env_file=".env") to the Settings class
backend/app/services/knowledge_base_service.py | Set vector_size dynamically: 768 for local, 1536 for Azure
.gitignore | Add qdrant_storage/

<hr><h2>Additional Context</h2><p>These issues appear only when running the backend <strong>locally with Ollama</strong>, while the Docker-based setup works correctly because environment variables are injected through <code inline="">docker-compose.yml</code>.</p></body></html>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bug: Local LLM setup fails — missing env loading and vector dimension mismatch #43

Bug Report

Bug 1: Backend fails to start with `Field required` errors

Steps to Reproduce

Expected Behavior

Actual Behavior

Root Cause

Bug 2: Knowledge base PDF upload fails with vector dimension mismatch

Steps to Reproduce

Expected Behavior

Actual Behavior

Root Cause

Additional: `qdrant_storage/` not ignored in git

Proposed Fix

Additional Context

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

File	Change
backend/app/core/config.py	Add SettingsConfigDict(env_file=".env") to the Settings class
backend/app/services/knowledge_base_service.py	Set vector_size dynamically: 768 for local, 1536 for Azure
.gitignore	Add qdrant_storage/

Bug: Local LLM setup fails — missing env loading and vector dimension mismatch #43

Description

Bug Report

Bug 1: Backend fails to start with Field required errors

Steps to Reproduce

Expected Behavior

Actual Behavior

Root Cause

Bug 2: Knowledge base PDF upload fails with vector dimension mismatch

Steps to Reproduce

Expected Behavior

Actual Behavior

Root Cause

Additional: qdrant_storage/ not ignored in git

Proposed Fix

Additional Context

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions

Bug 1: Backend fails to start with `Field required` errors

Additional: `qdrant_storage/` not ignored in git