Skip to content

Add QuantizeEmbeddingInt8 and ShareEmbeddingLmHead graph surgeries for INT8 embedding quantization#2464

Open
apsonawane wants to merge 4 commits into
mainfrom
asonawane/tieword
Open

Add QuantizeEmbeddingInt8 and ShareEmbeddingLmHead graph surgeries for INT8 embedding quantization#2464
apsonawane wants to merge 4 commits into
mainfrom
asonawane/tieword

Commits

Commits on Apr 15, 2026