Commit 7ad0f37
committed
feat(llama-quant): Allow F16 and BF16 quants of ssm_conv1d.weight
This is experimantal!
Branch: Mamba2SSD
Signed-off-by: Gabe Goodhart <ghart@us.ibm.com>1 parent 82bba1d commit 7ad0f37
1 file changed
+13
-2
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
421 | 421 | | |
422 | 422 | | |
423 | 423 | | |
| 424 | + | |
| 425 | + | |
| 426 | + | |
| 427 | + | |
| 428 | + | |
| 429 | + | |
| 430 | + | |
| 431 | + | |
| 432 | + | |
| 433 | + | |
| 434 | + | |
| 435 | + | |
424 | 436 | | |
425 | 437 | | |
426 | 438 | | |
| |||
859 | 871 | | |
860 | 872 | | |
861 | 873 | | |
862 | | - | |
| 874 | + | |
863 | 875 | | |
864 | | - | |
865 | 876 | | |
866 | 877 | | |
867 | 878 | | |
| |||
0 commit comments