fix gptq quantization condition by jiqing-feng · Pull Request #2416 · huggingface/optimum

jiqing-feng · 2026-03-25T02:16:50Z

Same as huggingface/transformers#44588. The quantization only works for original nn.Linear module, subclass has custom forward so quantized layer cannot handle it.

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

jiqing-feng · 2026-03-25T02:17:09Z

Hi @SunMarc . Please also review this PR. Thanks!

cc @Qubitium

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Qubitium · 2026-03-25T08:11:48Z

@jiqing-feng @SunMarc Looks good to me for the short-term.

Long term, since we have lack of information, we really don't know if this object which inherits nn.Linear but is not exactly nn.Linear is truely not-quantizable. For example, if we have a module that overrides nn.Linear and only wraps forward and the code inside just move tensors from disk to gpu pre-forward, and then after fwd, move the tensor back to disk, it would be black-listed by this logic but it is actually qualifiable for quantization.

Like my comment in huggingface/transformers#44588 (comment), in the future, we need much more information to better decide.

In the current space with lack of info, any decision we make is going to be incomplete and will either target too wide or too narrow.

fix gptq quantization condition

218ec23

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

revert useless change

5df216c

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Qubitium approved these changes Mar 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix gptq quantization condition#2416

fix gptq quantization condition#2416
jiqing-feng wants to merge 2 commits intohuggingface:mainfrom
jiqing-feng:main

jiqing-feng commented Mar 25, 2026

Uh oh!

jiqing-feng commented Mar 25, 2026 •

edited

Loading

Uh oh!

Qubitium commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

jiqing-feng commented Mar 25, 2026

Uh oh!

jiqing-feng commented Mar 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Qubitium commented Mar 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

jiqing-feng commented Mar 25, 2026 •

edited

Loading