-
Notifications
You must be signed in to change notification settings - Fork 61
Add ONNX Sub Functions Export Feature for AutoModelForCausalLM #621
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
92f320c to
acec54f
Compare
|
Let's keep it uniform. Can we rename |
done. |
Please modify the PR commit message and desp. accordingly. thanks |
ochougul
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
review WIP.
Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
Fix for this JIRA from Imagine team Signed-off-by: Ann Kuruvilla <akuruvil@qti.qualcomm.com> Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
65d24bc to
6daa209
Compare
|
@abhishek-singh591 please rebase the PR |
vbaddi
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM, thanks 👍
Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
ochougul
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approving. Add todo for CustomOpTransform and merge once CI is passing.
Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
) This PR introduces support for exporting ONNX modules as **functions**, enabling more efficient model compilation and execution on hardware. - Added a new flag **`use_onnx_subfunctions`** to control ONNX function export behavior. - Integrated ONNX function export capability into the inference pipeline. Set the flag before running inference (either during export or compile): ```bash model.export(tmp_path, use_onnx_subfunctions=True) ``` This feature is **opt-in** and requires an explicit environment variable. Existing workflows remain unaffected when the flag is disabled. --------- Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com> Signed-off-by: Ann Kuruvilla <akuruvil@qti.qualcomm.com> Co-authored-by: quic-akuruvil <quic_akuruvil@quicinc.com> Signed-off-by: abhishek-singh591 <sabhis@qti.qualcomm.com>
ONNX Functions Export Support
Overview
This PR introduces support for exporting ONNX modules as functions, enabling more efficient model compilation and execution on hardware.
Key Changes
use_onnx_subfunctionsto control ONNX function export behavior.How to Enable ONNX Function Export
Set the flag before running inference (either during export or compile):
Backward Compatibility
This feature is opt-in and requires an explicit environment variable. Existing workflows remain unaffected when the flag is disabled.