feat(local): add model pool unload to release GPU memory by starpit · Pull Request #903 · IBM/spnl

starpit · 2026-02-23T21:00:19Z

Summary

Adds ModelPool::unload_all() to clear all cached models and release GPU memory
Exposes as spnl::model_pool::unload_all() (behind local feature gate)
Fixes OOM when running multiple models sequentially (e.g. multi-model PIC benchmarks) — previously loaded models accumulated in VRAM

Test plan

Run bench pic --full (multi-model sweep) — verify models unload between runs and no OOM
Run single-model bench — verify no regression (model still cached within a run)
cargo check --features bench,metal

🤖 Generated with Claude Code

When running multiple models sequentially (e.g. multi-model benchmarks), previously loaded models remained in VRAM causing OOM errors. Adds ModelPool::unload_all() and exposes it as spnl::model_pool::unload_all() so callers can free GPU memory between model runs. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com> Signed-off-by: Nick Mitchell <nickm@us.ibm.com>

starpit added the made with opus4.6 label Feb 23, 2026

starpit merged commit 7757498 into IBM:main Feb 23, 2026
36 checks passed

starpit deleted the feat/model-pool-unload branch February 23, 2026 22:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(local): add model pool unload to release GPU memory#903

feat(local): add model pool unload to release GPU memory#903
starpit merged 1 commit into
IBM:mainfrom
starpit:feat/model-pool-unload

starpit commented Feb 23, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

starpit commented Feb 23, 2026

Summary

Test plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant