Importance weight based sparse attention implementation for auto-regressive decoding.#2
Open
Importance weight based sparse attention implementation for auto-regressive decoding.#2
Commits
Commits on Oct 24, 2023
- committed
Feng Li - committed
Feng Li - committed
Feng Li