-
Notifications
You must be signed in to change notification settings - Fork 18
Adds support for int8 w8a8_gemlite quantization #34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Changes from all commits
Commits
Show all changes
22 commits
Select commit
Hold shift + click to select a range
184349c
add torchao quantize_
anm-ol c1d0bfa
testing
anm-ol fd1e96b
testing yes
anm-ol 3101a1e
use taehv overide
anm-ol b7c015b
yuh
anm-ol 6710df9
add apply qat
anm-ol 4575508
yuh
anm-ol 40ad3f8
uh
anm-ol 1da5432
enable int4 benchmarking and inference
anm-ol b53ec0a
apply quantize_model w8a8
anm-ol 9a1ae6d
add int8 ptq
anm-ol 7838b05
quant none
anm-ol dbc4b73
int8 gemlite implementation
anm-ol 5fc5e73
clean up, remove torchao quantization
anm-ol 1418464
add gemlite to requirements
anm-ol 74f14f5
remove unused quant kernels and imports
anm-ol b545c08
restore gen_sample.py, more cleanup
anm-ol 945278d
update readme with Quantization docs
anm-ol e8a6e0f
fixed requirements gemlite
anm-ol ab2c74b
Clean up pyproject.toml and add config defaults to base_model.py
anm-ol 721218b
Add gemlite warmup+cache, and update gemlite version
anm-ol 470a05a
cleanup pyproject.toml and resize in examples
anm-ol File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.