[plugin][oot] Add Kimi-K2.5 support by gbyu-amd · Pull Request #401 · ROCm/ATOM

gbyu-amd · 2026-03-24T16:37:36Z

Motivation

This PR added the support for Kimi-K2.5-MXFP4 with vLLM oot path. Functionality and accuracy pass. The recipe is provided as well.

Technical Details

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

recipes/atom_vllm/Kimi-K2.5.md

atom/plugin/vllm/model_wrapper.py

atom/model_config/kimi_k25.py

wuhuikx · 2026-03-25T07:04:38Z

atom/models/kimi_k25.py

+
+        def load_weights(self, weights: Iterable[tuple[str, torch.Tensor]]) -> set[str]:
+            # load weights in plugin mode and discard passed weights generator
+            # here prefix is "model." because Qwen3ForCausalLM is constructed in model


is this comment here right?

atom/config.py

wuhuikx · 2026-03-25T10:04:28Z

recipes/atom_vllm/Kimi-K2.5.md

+
+The ATOM vLLM plugin backend keeps the standard vLLM CLI, server APIs, and general usage flow compatible with upstream vLLM. For general server options and API usage, refer to the [official vLLM documentation](https://docs.vllm.ai/en/latest/).
+
+```bash


Do we need any env var here like quick allreduce to make the performance better and keep accuracy at the same time? If so we can point it out to let users to have a try, but tell them the accuracy risk.

Yes, I think we should put such specific env var in our recipes. Maybe we can add it in another pr for all the recipes under atom_vllm, right now all the recipes just provide the basic launch cmd without providing perf-boost env var.

Guanbao Yu added 6 commits March 24, 2026 02:52

enable kimik2.5 with oot

e70bb08

remove print and fix weight loading

fd91c0c

fix and refine code

fa99def

code refine and lint

be61258

remove useless code

42ad281

add recipe for Kimi-K2.5

bbc8c77

gbyu-amd requested a review from zejunchen-zejun March 24, 2026 16:37

gbyu-amd and others added 2 commits March 25, 2026 00:37

Merge branch 'main' into guanbao/oot_kimi2.5

ae39b64

Merge branch 'main' into guanbao/oot_kimi2.5

0091f05

wuhuikx reviewed Mar 25, 2026

View reviewed changes

wuhuikx requested a review from ganyi1996ppo March 25, 2026 07:07

wuhuikx reviewed Mar 25, 2026

View reviewed changes

atom/config.py Outdated Show resolved Hide resolved

wuhuikx requested a review from ZhangLirong-amd March 25, 2026 07:25

resolve comments

e21a982

wuhuikx reviewed Mar 25, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[plugin][oot] Add Kimi-K2.5 support#401

[plugin][oot] Add Kimi-K2.5 support#401
gbyu-amd wants to merge 9 commits intomainfrom
guanbao/oot_kimi2.5

gbyu-amd commented Mar 24, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wuhuikx Mar 25, 2026

Uh oh!

gbyu-amd Mar 25, 2026

Uh oh!

Uh oh!

wuhuikx Mar 25, 2026

Uh oh!

gbyu-amd Mar 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants


		The ATOM vLLM plugin backend keeps the standard vLLM CLI, server APIs, and general usage flow compatible with upstream vLLM. For general server options and API usage, refer to the [official vLLM documentation](https://docs.vllm.ai/en/latest/).

		```bash

Conversation

gbyu-amd commented Mar 24, 2026

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Uh oh!

Uh oh!

Uh oh!

wuhuikx Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

gbyu-amd Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

wuhuikx Mar 25, 2026

Choose a reason for hiding this comment

Uh oh!

gbyu-amd Mar 26, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants