Skip to content

Importance weight based sparse attention implementation for auto-regressive decoding.#2

Open
FengDSP wants to merge 3 commits intomainfrom
important_kv_cache
Open

Importance weight based sparse attention implementation for auto-regressive decoding.#2
FengDSP wants to merge 3 commits intomainfrom
important_kv_cache

Commits

Commits on Oct 24, 2023