-
Notifications
You must be signed in to change notification settings - Fork 114
Pull requests: llm-d/llm-d-kv-cache
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
update: add make target to python lint + fix lint
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#536
opened Apr 23, 2026 by
zdtsw
Contributor
Loading…
fix(e2e): build UDS tokenizer image for linux on non-linux hosts
size/S
Denotes a PR that changes 10-29 lines, ignoring generated files.
#535
opened Apr 23, 2026 by
gyliu513
Contributor
Loading…
Add Hybrid Multi-head Attention (HMA) support for KV-Cache scoring
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#533
opened Apr 19, 2026 by
kapiljain1989
Loading…
add group_id tracking for HMA model support
size/XL
Denotes a PR that changes 500-999 lines, ignoring generated files.
#532
opened Apr 19, 2026 by
kapiljain1989
Loading…
Added new model registry
size/XL
Denotes a PR that changes 500-999 lines, ignoring generated files.
#531
opened Apr 19, 2026 by
kapiljain1989
Loading…
Removed Old Helm Setup
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#530
opened Apr 19, 2026 by
kapiljain1989
Loading…
feat(fs_backend): add performance and stress tests
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#527
opened Apr 16, 2026 by
kfirtoledo
Collaborator
Loading…
2 tasks done
deps(actions): bump softprops/action-gh-release from 2 to 3
dependencies
Pull requests that update a dependency file
size/XS
Denotes a PR that changes 0-9 lines, ignoring generated files.
#516
opened Apr 14, 2026 by
dependabot
Bot
Loading…
fix: prevent write queue deadlock under high concurrency
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#512
opened Apr 13, 2026 by
kfirtoledo
Collaborator
Loading…
5 tasks done
Handling Attention Group id in KV events
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#510
opened Apr 10, 2026 by
kapiljain1989
Loading…
fix: register MaxPodHitCount metric in Collectors()
size/M
Denotes a PR that changes 30-99 lines, ignoring generated files.
#509
opened Apr 10, 2026 by
wenhug
Contributor
Loading…
3 tasks done
deps(go): bump go.opentelemetry.io/otel/sdk from 1.39.0 to 1.43.0
dependencies
Pull requests that update a dependency file
size/M
Denotes a PR that changes 30-99 lines, ignoring generated files.
#503
opened Apr 8, 2026 by
dependabot
Bot
Loading…
deps(actions): bump docker/setup-buildx-action from 3 to 4
dependencies
Pull requests that update a dependency file
size/XS
Denotes a PR that changes 0-9 lines, ignoring generated files.
#501
opened Apr 7, 2026 by
dependabot
Bot
Loading…
deps(actions): bump docker/build-push-action from 6 to 7
dependencies
Pull requests that update a dependency file
size/XS
Denotes a PR that changes 0-9 lines, ignoring generated files.
#500
opened Apr 7, 2026 by
dependabot
Bot
Loading…
Add object store support to llm-d storage offloading
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#499
opened Apr 6, 2026 by
effi-ofer
Loading…
feat: Add golden test case for multi-modal
lifecycle/stale
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#485
opened Mar 30, 2026 by
gyliu513
Contributor
Loading…
feat: Add HMA support to FS connector
size/XL
Denotes a PR that changes 500-999 lines, ignoring generated files.
#476
opened Mar 29, 2026 by
kfirtoledo
Collaborator
•
Draft
4 tasks done
test: Add test and usage example for mm requests
lifecycle/stale
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#453
opened Mar 23, 2026 by
sagearc
Collaborator
Loading…
onboard Github Pages PEP503 compatible simple index for llm-d-fs-offloading connector
lifecycle/rotten
#442
opened Mar 22, 2026 by
Gregory-Pereira
Member
Loading…
deps(go): bump google.golang.org/grpc from 1.77.0 to 1.79.3
dependencies
Pull requests that update a dependency file
lifecycle/rotten
#438
opened Mar 19, 2026 by
dependabot
Bot
Loading…
feat:add support to invalidate KV cache via AllBlocksCleared event
size/XL
Denotes a PR that changes 500-999 lines, ignoring generated files.
#437
opened Mar 18, 2026 by
yash9263
Loading…
deps(go): bump the go-dependencies group across 1 directory with 16 updates
dependencies
Pull requests that update a dependency file
size/L
Denotes a PR that changes 100-499 lines, ignoring generated files.
#430
opened Mar 17, 2026 by
dependabot
Bot
Loading…
feat: Add Hybrid Model Architecture (HMA) Support in Prefix-Cache Aware Scheduling
size/XXL
Denotes a PR that changes 1000+ lines, ignoring generated files.
#427
opened Mar 16, 2026 by
kapiljain1989
Loading…
Previous Next
ProTip!
Updated in the last three days: updated:>2026-04-20.