Skip to content

[AMD] Add Minimax tp8 with ep and update vllm image for MI355x#868

Open
benenzhu wants to merge 5 commits intoSemiAnalysisAI:mainfrom
benenzhu:minimax-mi355-opt
Open

[AMD] Add Minimax tp8 with ep and update vllm image for MI355x#868
benenzhu wants to merge 5 commits intoSemiAnalysisAI:mainfrom
benenzhu:minimax-mi355-opt

Conversation

@benenzhu
Copy link

@benenzhu benenzhu commented Mar 5, 2026

Add tp8 with ep for conc 32 - 256 for Minimax in mi355x.

benenzhu and others added 3 commits March 11, 2026 23:51
Made-with: Cursor

# Conflicts:
#	benchmarks/single_node/minimaxm2.5_fp8_mi355x.sh
#	perf-changelog.yaml
@benenzhu benenzhu reopened this Mar 21, 2026
@benenzhu benenzhu changed the title [AMD] Add Minimax tp8 with ep and remove tp4 for MI355x [AMD] Add Minimax tp8 with ep and update vllm image for MI355x Mar 21, 2026
@benenzhu
Copy link
Author

@chunfangamd May you help enable sweep on this? We can use the new vllm image now.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

Development

Successfully merging this pull request may close these issues.

1 participant