-
Notifications
You must be signed in to change notification settings - Fork 63
Pull requests: 1CatAI/1Cat-vLLM
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
[Bugfix] Default Qwen3 reasoning parser to prompt-has-open-think
#52
opened May 27, 2026 by
rivetphilbot
Loading…
2 of 3 tasks
Bump actions/stale from 10.1.1 to 10.3.0
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#51
opened May 27, 2026 by
dependabot
Bot
Loading…
[Bugfix] Allow fp8_e5m2 KV cache on W4A16 compressed-tensors models (V100/SM70)
#49
opened May 22, 2026 by
rivetphilbot
Loading…
3 of 4 tasks
fix: keep partial content when reasoning block is truncated by max_tokens
#47
opened May 19, 2026 by
rivetphilbot
Loading…
[V100/SM70] Add compressed-tensors dense WNA16 path + DeltaNet weight loading
#45
opened May 19, 2026 by
rivetphilbot
Loading…
6 tasks done
Fix SM70 docker build, CUDA 13.0 compat conflict, and README docker args
#43
opened May 17, 2026 by
titidatiti
•
Draft
Fix load_weights for FP16 Qwen3.5-MoE checkpoints with pre-fused expert tensors
#21
opened Apr 21, 2026 by
daelsc
Loading…
Bump actions/github-script from 8.0.0 to 9.0.0
dependencies
Pull requests that update a dependency file
github_actions
Pull requests that update GitHub Actions code
#19
opened Apr 14, 2026 by
dependabot
Bot
Loading…
ProTip!
Add no:assignee to see everything that’s not assigned.