Commit 2348268
committed
fix(ci): pin onnxruntime>=1.20.0 to ensure INT4 quantization support
- Specify minimum version 1.20.0 which has stable matmul_4bits_quantizer
- Install optimum without [onnxruntime] extra to avoid version downgrades
- Add package version diagnostics for debugging
- Fail fast if INT4 quantization is not available (instead of silent fallback)1 parent 757d3ed commit 2348268
1 file changed
+6
-3
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
163 | 163 | | |
164 | 164 | | |
165 | 165 | | |
166 | | - | |
167 | | - | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
168 | 171 | | |
169 | | - | |
| 172 | + | |
170 | 173 | | |
171 | 174 | | |
172 | 175 | | |
| |||
0 commit comments