Skip to content

Conversation

@mrariden
Copy link
Collaborator

@mrariden mrariden commented Nov 10, 2025

Check the state_dict after loading to verify that it's not a CP3 model. This will need to be revised if we ever implement concurrent support for CP3 and CP4

Passing tests on:

  • GH partial tests
  • Ubuntu full tests
  • mac partial tests

@mrariden
Copy link
Collaborator Author

resolves #1300

@mrariden
Copy link
Collaborator Author

Now the transformer will check the state_dict during model loading for the W2 parameter that reshapes token into pixel space. If that's not found, a ValueError is raised.

@mrariden mrariden merged commit 7a0f2c4 into main Nov 11, 2025
6 checks passed
@junliuchengxia
Copy link

winddows all fali

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants