Skip to content

chore: bump to llmd 0.5.0#855

Merged
starpit merged 3 commits into
mainfrom
llmd-050
Feb 17, 2026
Merged

chore: bump to llmd 0.5.0#855
starpit merged 3 commits into
mainfrom
llmd-050

Conversation

@starpit
Copy link
Copy Markdown
Member

@starpit starpit commented Feb 16, 2026

Changes

  • Bump to llmd 0.5.0
  • Increase dev mode GCE disk to 300GB pd-balanced for compilation
  • Keep production mode at 100GB pd-ssd for optimal serving performance

Problem

The previous PR run failed with exit code 128 during cloud-init setup. Analysis of the logs showed the VM was compiling Rust dependencies but ran out of disk space, causing the build to fail and the instance to shut down prematurely.

Solution

This PR increases the disk size for dev mode (CI/testing) from 100GB to 300GB and switches to pd-balanced disk type for cost efficiency. Production mode remains unchanged at 100GB pd-ssd for optimal serving performance.

The disk configuration now adapts based on whether SPNL_GITHUB is set:

  • Dev mode: 300GB pd-balanced (for compilation workloads)
  • Production mode: 100GB pd-ssd (for serving workloads)

This should resolve the disk space issues during Rust compilation of SPNL and its many dependencies (geodatafusion, lance-datafusion, tantivy, etc.).

@starpit starpit force-pushed the llmd-050 branch 11 times, most recently from 021e234 to a07df0d Compare February 17, 2026 16:32
- Increase dev mode GCE disk to 300GB pd-balanced for compilation
- Keep production mode at 100GB pd-ssd for optimal serving performance
- Fixes exit code 128 failures due to insufficient disk space during Rust compilation

Signed-off-by: Nick Mitchell <nickm@us.ibm.com>
… compatibility

- Add vllm_precompiled_wheel_commit field to GceConfig with default value from llm-d
- Update setup-dev.sh to use VLLM_PRECOMPILED_WHEEL_COMMIT for wheel lookup
- Propagate parameter through cloud-config.yaml and up.rs substitutions
- Fixes ImportError: undefined symbol _ZN3c104cuda29c10_cuda_check_implementationEiPKcS2_jb
- Ensures precompiled wheels match PyTorch/CUDA environment ABI

Signed-off-by: Nick Mitchell <nickm@us.ibm.com>
@starpit starpit force-pushed the llmd-050 branch 2 times, most recently from a9611d6 to 862bc76 Compare February 17, 2026 18:34
Signed-off-by: Nick Mitchell <nickm@us.ibm.com>
@starpit starpit merged commit b836b18 into main Feb 17, 2026
36 checks passed
@starpit starpit deleted the llmd-050 branch February 17, 2026 19:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant