Introduce Provider-Agnostic Local-First Execution (Ollama Backend) with Runtime Decoupling and Performance Hardening by spice14 · Pull Request #145 · VectifyAI/PageIndex

spice14 · 2026-03-04T18:15:12Z

This pull request introduces a provider-agnostic, local-first execution mode for PageIndex, enabling fully offline document indexing and reasoning using Ollama as the default LLM backend.

This is not a simple endpoint swap. The PR introduces runtime abstraction, response normalization, prompt governance, and performance hardening to make local execution stable, reproducible, and production-viable.

The core PageIndex tree-based, vectorless reasoning architecture is preserved. The inference layer and operational surface are restructured to support pluggable providers, with Ollama as the primary local backend.

Key Enhancements

Provider Runtime Decoupling

Replaced OpenAI-tied wrapper calls with provider-routed interfaces
Centralized provider selection into configuration instead of business logic
Preserved compatibility for optional future providers
This creates a clean inference boundary and prevents provider assumptions from leaking into traversal/indexing flows.

Finish-Reason Normalization

Introduced a response handling layer to standardize continuation behavior
Normalized provider-specific stop/truncation semantics
Reduced brittle assumptions in recursive tree traversal flows
This improves determinism across different model backends.

Local-First CLI & Operational Shift

Added CLI (cli.py) aligned with local inference defaults
Ollama integrated as first-class backend (mistral-based defaults, configurable)
Removed OpenAI dependency from default local workflow
Setup scripts for local model provisioning
No API keys required for default usage.

Prompt Governance Refactor

Externalized prompts into a registry-driven loader system
Removed large inline prompt strings from core logic
Enables easier cross-model tuning and reproducibility

Bounded Async Concurrency

Introduced semaphore-controlled parallelization in TOC detection and summarization flows
Improves throughput for slower local models
Maintains deterministic behavior

Adaptive Fallback & Chunking Improvements

Hardened no-TOC and hierarchical fallback paths
Improved resilience under degraded-model scenarios

Expanded Validation Surface

Added e2e and integration tests
Added parallel-processing validation scripts
Verified tree generation, node selection, and answer generation using Ollama
This reduces regression risk introduced by provider variability.

Why This Matters?

Upstream PageIndex is a powerful tree-based, vectorless reasoning framework for long documents.
This PR expands its footprint by:
Enabling fully offline execution
Reducing cloud/API coupling
Improving runtime abstraction boundaries
Increasing robustness across model providers
Supporting reproducible local experimentation
It moves PageIndex from an OpenAI-assumed execution model to a viable multi-provider, local-first architecture.

What’s Preserved

Core tree construction and reasoning logic
Indexing pipeline
High-level API contract and traversal design
No changes were made to the fundamental reasoning strategy.

Caveats

Smaller local models (e.g., ~3B) may struggle with complex reasoning compared to large hosted models.
Performance depends on local hardware (RAM/VRAM).
Upstream provider SDK code remains for compatibility; this PR reorients defaults but does not remove multi-provider flexibility.

Testing & Verification

Verified end-to-end CLI workflows with Ollama
Added automated tests covering:
Tree generation
Node selection
Answer generation
Parallel execution flows

Updated documentation for local setup and configuration

OpenAI SDK decoupled

Feature/oss

readme updated

Updated license section formatting and reference.

Updated license from MIT to GNU General Public License v3.

Updated the License section to include licensing details.

spice14 and others added 22 commits March 2, 2026 01:41

ollama replacement plan documented

8dc7521

Decoupled OpenAI SDK completely and coupled with Ollama instead

e534e93

remote clone commit

5965ab2

Decoupling complete

95c18b3

Merge pull request #1 from spice14/feature/oss

7ece404

OpenAI SDK decoupled

bug fixes, prompt registry addded and model upgrade

00c5225

bug fixes, prompt registry addded and model upgrade

14ecafb

Merge pull request #2 from spice14/feature/oss

b27f3db

Feature/oss

readme updated

984f45a

readme.md updated

526ba2e

Merge pull request #3 from spice14/feature/oss

6c3dcde

readme updated

Merge branch 'main' of https://github.com/spice14/PageIndexOllama

71db6c5

pyproject.toml added and docs removed

8689d55

Update README.md

5b127b3

Enhancements report added

0572e36

Merge branch 'main' of https://github.com/spice14/PageIndexOllama

6e90bd3

Update copyright year and owner in LICENSE file

61cade2

Update license section with specific license details

3d8c2db

Revise license section in README.md

662aaa6

Updated license section formatting and reference.

Fix License section formatting in README.md

4fc5bfc

Change license to GNU GPL v3

3c51ca6

Updated license from MIT to GNU General Public License v3.

Enhance License section in README.md

740526b

Updated the License section to include licensing details.

spice14 changed the title ~~Add local-first support with Ollama backend for PageIndex CLI and workflows~~ Introduce Provider-Agnostic Local-First Execution (Ollama Backend) with Runtime Decoupling and Performance Hardening Mar 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce Provider-Agnostic Local-First Execution (Ollama Backend) with Runtime Decoupling and Performance Hardening#145

Introduce Provider-Agnostic Local-First Execution (Ollama Backend) with Runtime Decoupling and Performance Hardening#145
spice14 wants to merge 22 commits intoVectifyAI:mainfrom
spice14:main

spice14 commented Mar 4, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

spice14 commented Mar 4, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

spice14 commented Mar 4, 2026 •

edited

Loading