Commit 0450f12
Fix vLLM integration: use register_model_loader API, subclass BaseModelLoader
- Use vLLM v0.17.0's register_model_loader decorator
- Subclass BaseModelLoader from base_loader module
- Fallback for older vLLM versions (registry dict)
- Dynamic base class to avoid import failure when vLLM not installed
- Add GPU integration test (verified on RTX 4090: 2.9x speedup)
Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>1 parent 37affb5 commit 0450f12
3 files changed
Lines changed: 301 additions & 46 deletions
0 commit comments