feat: remove ollama pull feature and add mistralrs local embedding support by starpit · Pull Request #851 · IBM/spnl

starpit · 2026-02-15T17:52:29Z

BREAKING CHANGE: Removed automatic ollama model pulling functionality

This PR removes ollama installation and model pulling from CI/CD workflows and adds support for local embedding models via mistral.rs backend.

Changes

Docker & Infrastructure Cleanup

Deleted docker/Containerfile.hostbuild (14 lines removed)
Deleted docker/Containerfile.hostbuild.ollama (30 lines removed)
Removed ollama installation from docker/gce/vllm scripts
Removed ollama health check from docker/gce/vllm/test.d/spnl-speedup.sh

CI/CD Workflow Changes

Removed ollama installation step from .github/workflows/core.yml
Removed 'pull' feature from cargo test and clippy commands
Simplified to single disk space cleanup step

Embedding Backend Implementation

Added spnl/src/generate/backend/mistralrs/embed.rs with logging support
- New embed() function using mistralrs EmbeddingModelBuilder
- Supports local embedding models with device detection
- Respects MISTRALRS_VERBOSE environment variable for logging
Updated spnl/src/generate/backend/mistralrs/mod.rs to export embed module

Embedding System Refactoring

Moved contentify() helper from openai.rs to embed.rs for shared use
Added support for local/ prefix to use mistralrs backend
Refactored embed() to return Vec<Vec> consistently across backends

CLI Default Model Changes

Changed default generative model from 'ollama/granite3.3:2b' to 'llama3.2:1b'
Changed default embedding model from 'ollama/mxbai-embed-large:335m' to 'local/google/embeddinggemma-300m'

Cargo Configuration

Added 'local' feature to default features list
Removed unused tokio-util dependency (not referenced in any features or code)

Test Updates

Updated optimizer::hlo::tests::retrieve to use new default embedding model

Documentation

Removed 'pull' feature from docs/feature-flags.md
Regenerated complete feature-flags.md with all current features organized by category

Net Impact

23 files changed: +217 insertions, -450 deletions
Net reduction of 233 lines

Users will need to manually pull ollama models before use if using ollama backend.

Made with Bob

…pport BREAKING CHANGE: Removed automatic ollama model pulling functionality This commit removes ollama installation and model pulling from CI/CD workflows and adds support for local embedding models via mistral.rs backend. Docker & Infrastructure Cleanup: - Deleted docker/Containerfile.hostbuild (14 lines removed) - Deleted docker/Containerfile.hostbuild.ollama (30 lines removed) - Removed ollama installation from docker/gce/vllm/setup.sh - Removed ollama installation from docker/gce/vllm/setup-dev.sh - Removed ollama installation from docker/gce/vllm/create-vllm-gce-image.sh - Removed ollama health check from docker/gce/vllm/test.d/spnl-speedup.sh CI/CD Workflow Changes (.github/workflows/core.yml): - Removed ollama installation step (Linux curl install, macOS brew install) - Removed ollama systemd service startup logic - Simplified to single disk space cleanup step - Reduced workflow complexity by 12 lines Embedding Backend Implementation: - Added spnl/src/generate/backend/mistralrs/embed.rs (37 lines) * New embed() function using mistralrs EmbeddingModelBuilder * Supports local embedding models with device detection * Converts EmbedData to text strings and generates embeddings - Updated spnl/src/generate/backend/mistralrs/mod.rs to export embed module Embedding System Refactoring (spnl/src/augment/embed.rs): - Moved contentify() helper from openai.rs to embed.rs for shared use - Added support for local/ prefix to use mistralrs backend - Refactored embed() to return Vec<Vec<f32>> consistently across backends - Changed all backend calls to collect() results into Vec for uniform handling - Improved error handling with proper Result propagation OpenAI Backend Cleanup (spnl/src/generate/backend/openai.rs): - Removed duplicate contentify() function (18 lines) - Updated embed() to use shared contentify() from augment::embed module - Simplified code by removing redundant helper function CLI Default Model Changes (cli/src/args.rs): - Changed default generative model from 'ollama/granite3.3:2b' to 'llama3.2:1b' - Changed default embedding model from 'ollama/mxbai-embed-large:335m' to 'local/google/embeddinggemma-300m' - Updated to use local embedding models by default Cargo Configuration (spnl/Cargo.toml): - Added 'local' feature to default features list - Enables mistralrs local embedding support by default - Removed unused tokio-util dependency (not referenced in any features or code) Index & RAG Updates: - Updated spnl/src/augment/index/mod.rs to use new embed signature - Removed pull_if_needed call from spnl/src/augment/index/raptor.rs Build System: - Updated .github/scripts/free-up-disk-space-fast.sh with additional cleanup Net change: -124 lines, +88 lines (36 lines removed overall) Users will need to manually pull ollama models before use if using ollama backend. Signed-off-by: Nick Mitchell <nickm@us.ibm.com>

starpit added the made with bob PR created with assistance from Bob AI label Feb 15, 2026

starpit force-pushed the remove-ollama-pull-feature branch 6 times, most recently from 583baf8 to 1667d1e Compare February 17, 2026 18:45

starpit force-pushed the remove-ollama-pull-feature branch from 1667d1e to 75e6414 Compare February 17, 2026 21:20

starpit mentioned this pull request Feb 18, 2026

feat(mistralrs): add local embedding support #875

Merged

2 tasks

starpit closed this Feb 18, 2026

starpit deleted the remove-ollama-pull-feature branch February 18, 2026 15:58

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: remove ollama pull feature and add mistralrs local embedding support#851

feat: remove ollama pull feature and add mistralrs local embedding support#851
starpit wants to merge 1 commit into
mainfrom
remove-ollama-pull-feature

starpit commented Feb 15, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

starpit commented Feb 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Docker & Infrastructure Cleanup

CI/CD Workflow Changes

Embedding Backend Implementation

Embedding System Refactoring

CLI Default Model Changes

Cargo Configuration

Test Updates

Documentation

Net Impact

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

starpit commented Feb 15, 2026 •

edited

Loading