Skip to content

[GPUHeuristics] Fix RDNA4 conv heuristic seeds to resolve regressions#23841

Merged
yzhang93 merged 1 commit intoiree-org:mainfrom
yzhang93:rdna4-improve-conv-seeds
Mar 20, 2026
Merged

[GPUHeuristics] Fix RDNA4 conv heuristic seeds to resolve regressions#23841
yzhang93 merged 1 commit intoiree-org:mainfrom
yzhang93:rdna4-improve-conv-seeds

Conversation

@yzhang93
Copy link
Copy Markdown
Contributor

After the previous RDNA4 seed tuning commit, several medium/large-sized convolution shapes showed significant regressions. To fix the issue, increase conv subgroupCount seed from 4 to 8. This change will bring back the performance upto 2x faster, while not introduce new regressions.

Also update the convolution tests to use IGEMM path with use_igemm=true.

@yzhang93 yzhang93 force-pushed the rdna4-improve-conv-seeds branch 2 times, most recently from a35e384 to f5759b8 Compare March 19, 2026 18:08
Copy link
Copy Markdown
Contributor

@nirvedhmeshram nirvedhmeshram left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@yzhang93 yzhang93 enabled auto-merge (squash) March 19, 2026 19:18
After the previous RDNA4 seed tuning commit, several medium/large-sized
convolution shapes showed significant regressions. To fix the issue,
increase conv subgroupCount seed from 4 to 8. This change will bring back
the performance upto 2x faster, while not introduce new regressions.

Also update the convolution tests to use IGEMM path with use_igemm=true.

Signed-off-by: yzhang93 <zhyuhang88@gmail.com>
@yzhang93 yzhang93 force-pushed the rdna4-improve-conv-seeds branch from f5759b8 to a666b6b Compare March 20, 2026 04:41
@yzhang93 yzhang93 merged commit 5bc3ff0 into iree-org:main Mar 20, 2026
53 of 56 checks passed
@yzhang93 yzhang93 deleted the rdna4-improve-conv-seeds branch March 30, 2026 22:07
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants