Conversation
Signed-off-by: Xin He <xin3.he@intel.com>
for more information, see https://pre-commit.ci
|
|
Thank you for the checking. @XuehaoSun |
Signed-off-by: Xin He <xin3.he@intel.com>
|
It's more complex than the original expectation. since it's an omni model, more time is needed to enable it. |
|
|
Signed-off-by: Xin He <xin3.he@intel.com>
|
@XuehaoSun Please run with |
There was a problem hiding this comment.
Pull request overview
Adds support for the meituan-longcat/LongCat-Next MLLM family by introducing a dedicated longcat_next processor/template and small loader behavior tweaks to avoid chat-template related failures.
Changes:
- Register a new
longcat_nextMLLM template and processor. - Adjust chat-template handling to avoid calling
apply_chat_templatewhen no template is present. - Add LongCat-specific tokenizer loading behavior (
fix_mistral_regex) and additional calibration/block-selection tweaks.
Reviewed changes
Copilot reviewed 5 out of 5 changed files in this pull request and generated 4 comments.
Show a summary per file
| File | Description |
|---|---|
| auto_round/utils/model.py | Adds LongCat-related tokenizer loading flag and (per diff) updates multimodal block discovery behavior. |
| auto_round/envs.py | Adds a new environment toggle for calibration device behavior. |
| auto_round/compressors/mllm/template.py | Registers the new longcat_next template. |
| auto_round/compressors/mllm/processor.py | Adds LongCatNextProcessor and tightens chat-template checks. |
| auto_round/compressors/base.py | Adds (per diff) an env-controlled override for GPU vs CPU calibration caching behavior. |
| _register_template("mistral3", default_dataset="NeelNanda/pile-10k", processor=PROCESSORS["hf"]) | ||
| _register_template("mistral3_2", default_dataset="liuhaotian/llava", processor=PROCESSORS["mistral3_2"]) | ||
| _register_template("gemma3", default_dataset="NeelNanda/pile-10k", processor=PROCESSORS["hf"]) | ||
| _register_template("longcat_next", default_dataset="NeelNanda/pile-10k", processor=PROCESSORS["longcat_next"]) | ||
|
|
There was a problem hiding this comment.
A new model_type/template ('longcat_next') is registered, but there are existing unit tests that assert processor/template registration for other omni models (e.g., test/test_cpu/models/test_omni_model.py). Add analogous assertions for 'longcat_next' so regressions in registration or imports are caught.
Signed-off-by: Xin He <xin3.he@intel.com>
Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>
Description
ValueError: Cannot use apply_chat_template because this processor does not have a chat template.
To reproduce:
auto-round /storage/xinhe/meituan-longcat/LongCat-Next/Type of Change
Related Issues
Fixes or relates to #
Checklist Before Submitting