Add ONNX rewriter by Giuseppe5 · Pull Request #82 · huggingface/optimum-amd

Giuseppe5 · 2024-02-22T10:18:52Z

Depends on #110

Giuseppe5 · 2024-03-19T13:43:31Z

+            # The number of Matmul+Gemm has to be less compared to the model pre-transformation
+            # This is not zero since there are matmul that are not linear layers so they are not replaced
+            # and some linears layers can be excluded from quantization
+            assert matmul_gemm_counter <= original_matmul_gemm_counter


A better test would have us monitoring exactly how many Matmul and Gemm we expect to have before/after transformations, and similarly with matmulinteger.

Considering all the other changes that we are doing plus new tests that we will be adding, maybe we could wait for this kind of implementation when everything is more stable.

Giuseppe5 · 2024-03-19T13:44:51Z

It seems that onnx_tool has a bad typing which is making the tests to fail

mht-sharma · 2024-03-19T15:01:01Z

It seems that onnx_tool has a bad typing which is making the tests to fail

@Giuseppe5 for tests could you use python>=3.9. It should work with this

fxmarty · 2024-03-20T05:40:50Z

python 3.8 is still largely used (see https://pypistats.org/packages/transformers), although EOL is in a few months. We thus probably don't want to have python_requires=">3.9" in setup.py, but should raise a meaningful error if the onnx_tool has issues with python 3.8

fxmarty

Great work!

fxmarty

LGTM

mht-sharma

Thanks! LGTM

mht-sharma · 2024-03-21T09:07:41Z

+            # The number of Matmul+Gemm has to be less compared to the model pre-transformation
+            # This is not zero since there are matmul that are not linear layers so they are not replaced
+            # and some linears layers can be excluded from quantization
+            assert matmul_gemm_counter <= original_matmul_gemm_counter


Suggested change

assert matmul_gemm_counter <= original_matmul_gemm_counter

self.assertTrue(matmul_gemm_counter <= original_matmul_gemm_counter)

Giuseppe5 force-pushed the onnx_rewriter branch from 0c942ee to 21cd69b Compare March 8, 2024 13:18

Giuseppe5 changed the base branch from brevitas-compatibility to main March 8, 2024 13:18

Giuseppe5 force-pushed the onnx_rewriter branch 2 times, most recently from 25f2f66 to 1c39f0a Compare March 14, 2024 16:34

Giuseppe5 marked this pull request as ready for review March 14, 2024 16:36

Giuseppe5 mentioned this pull request Mar 15, 2024

Feat: export dq only #110

Merged

Giuseppe5 force-pushed the onnx_rewriter branch 2 times, most recently from ebfa390 to 2cf0db0 Compare March 19, 2024 10:55

Giuseppe5 added 7 commits March 19, 2024 11:03

Add ONNX rewriter

bb4f843

New interface for rewriter

a2a4862

remove rewriter file

0c0cc97

Fixes

7b10fc3

Fix and formatting

f3c8800

Fix install

333663d

Formatting

37866e7

Giuseppe5 force-pushed the onnx_rewriter branch from 4225415 to 37866e7 Compare March 19, 2024 11:04

Formatting

fd64a24

mht-sharma mentioned this pull request Mar 19, 2024

Add decoder modeling #108

Open

5 tasks

Giuseppe5 added 2 commits March 19, 2024 13:29

Adding tests for MatmulInteger

dc02967

Formatting

09e6cb7

Giuseppe5 requested a review from mht-sharma March 19, 2024 13:31

Giuseppe5 commented Mar 19, 2024

View reviewed changes

mht-sharma requested a review from fxmarty March 19, 2024 13:48

Bump python version to 3.9 for tests

c577f97

fxmarty reviewed Mar 20, 2024

View reviewed changes

Review

d8de4ef

Giuseppe5 requested a review from fxmarty March 20, 2024 10:03

Giuseppe5 added 2 commits March 20, 2024 11:28

Fix node names

21a05fa

Formatting

fac256f

Giuseppe5 force-pushed the onnx_rewriter branch from cc5b3b8 to fac256f Compare March 20, 2024 11:33

fxmarty approved these changes Mar 21, 2024

View reviewed changes

mht-sharma approved these changes Mar 21, 2024

View reviewed changes

Change assert type

4e2523d

mht-sharma merged commit ca32e8e into huggingface:main Mar 22, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ONNX rewriter#82

Add ONNX rewriter#82
mht-sharma merged 15 commits intohuggingface:mainfrom
Giuseppe5:onnx_rewriter

Giuseppe5 commented Feb 22, 2024 •

edited

Loading

Uh oh!

Giuseppe5 Mar 19, 2024

Uh oh!

Giuseppe5 commented Mar 19, 2024

Uh oh!

mht-sharma commented Mar 19, 2024 •

edited

Loading

Uh oh!

fxmarty commented Mar 20, 2024

Uh oh!

fxmarty left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fxmarty left a comment

Uh oh!

mht-sharma left a comment

Uh oh!

mht-sharma Mar 21, 2024

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

	assert matmul_gemm_counter <= original_matmul_gemm_counter
	self.assertTrue(matmul_gemm_counter <= original_matmul_gemm_counter)

Conversation

Giuseppe5 commented Feb 22, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Giuseppe5 Mar 19, 2024

Choose a reason for hiding this comment

Uh oh!

Giuseppe5 commented Mar 19, 2024

Uh oh!

mht-sharma commented Mar 19, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

fxmarty commented Mar 20, 2024

Uh oh!

fxmarty left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

fxmarty left a comment

Choose a reason for hiding this comment

Uh oh!

mht-sharma left a comment

Choose a reason for hiding this comment

Uh oh!

mht-sharma Mar 21, 2024

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Giuseppe5 commented Feb 22, 2024 •

edited

Loading

mht-sharma commented Mar 19, 2024 •

edited

Loading