Copilot/add all routed experts support #293

jinminxi104 · 2025-12-27T16:39:54Z

No description provided.

Co-authored-by: jinminxi104 <18713681+jinminxi104@users.noreply.github.com>

…ble names Co-authored-by: jinminxi104 <18713681+jinminxi104@users.noreply.github.com>

Co-authored-by: jinminxi104 <18713681+jinminxi104@users.noreply.github.com>

Removed output assignment from runner.capture call.

Removed make_output_buffers and get_outputs_cudagraph methods from CudaGraphMixin.

Copilot

Pull request overview

This PR adds support for "all routed experts" by refactoring CUDA graph output handling in the Ascend backend. The changes introduce a new warmup phase during graph capture and delegate output buffer management to new model methods.

Key changes:

Added warmup execution before graph capture to create output buffers
Refactored output handling to use make_output_buffers() and get_outputs_cudagraph() methods
Moved the capturing flag management inside the graph capture context

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

dlinfer/framework/lmdeploy_ext/cudagraph/ascend_cudagraph.py

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

Copilot AI and others added 11 commits December 27, 2025 07:44

Initial plan

22327c8

Add all_routed_experts support in ascend cudagraph

18802b5

Co-authored-by: jinminxi104 <18713681+jinminxi104@users.noreply.github.com>

Add clarifying comment for ellipsis in all_routed_experts slicing

00ddf4b

Co-authored-by: jinminxi104 <18713681+jinminxi104@users.noreply.github.com>

Address code review feedback: simplify condition and fix grammar

d88c993

Co-authored-by: jinminxi104 <18713681+jinminxi104@users.noreply.github.com>

Fix grammar in comment for better clarity

d5c5ad7

Co-authored-by: jinminxi104 <18713681+jinminxi104@users.noreply.github.com>

Improve code clarity: add assertion message and use descriptive varia…

2cbe4ff

…ble names Co-authored-by: jinminxi104 <18713681+jinminxi104@users.noreply.github.com>

Add type hints for better API clarity

f3b1ebc

Co-authored-by: jinminxi104 <18713681+jinminxi104@users.noreply.github.com>

Final polish: improve error message and comment clarity

4af6d08

Co-authored-by: jinminxi104 <18713681+jinminxi104@users.noreply.github.com>

Refactor capturing logic in AscendGraphRunner

7ddde69

Removed output assignment from runner.capture call.

Remove unused output buffer methods from CudaGraphMixin

af47cbe

Removed make_output_buffers and get_outputs_cudagraph methods from CudaGraphMixin.

fix

0278be6

jinminxi104 requested review from Copilot, tangzhiyi11 and yao-fengchen December 28, 2025 16:39

Copilot started reviewing on behalf of jinminxi104 December 28, 2025 16:39 View session

Copilot AI reviewed Dec 28, 2025

View reviewed changes

Update dlinfer/framework/lmdeploy_ext/cudagraph/ascend_cudagraph.py

4054c25

Co-authored-by: Copilot <175728472+Copilot@users.noreply.github.com>

tangzhiyi11 approved these changes Dec 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Copilot/add all routed experts support #293

Copilot/add all routed experts support #293

Uh oh!

jinminxi104 commented Dec 27, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Copilot/add all routed experts support #293

Are you sure you want to change the base?

Copilot/add all routed experts support #293

Uh oh!

Conversation

jinminxi104 commented Dec 27, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants