Skip to content

Conversation

@clairesonglee
Copy link
Contributor

  • Maximize batch sizes for DeepSeek V3 on MI355X
  • Separate DeepSeek V3 16B configs to BF16 and FP8 precision configs

@Xiaoming-AMD Xiaoming-AMD merged commit ddf008c into main Jan 7, 2026
2 checks passed
@Xiaoming-AMD Xiaoming-AMD deleted the clairlee/torchtitan/optimize-dsv2-mi355-batch-sizes branch January 7, 2026 08:23
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants