Fix element type mismatch in attention preSoftmax fusion by justinrosner · Pull Request #2211 · ROCm/rocMLIR

justinrosner · 2026-01-20T13:44:56Z

Motivation

This PR fixes a crash that MIGraphX was seeing when compiling an attention kernel with fusion: https://amd-hub.atlassian.net/browse/AIROCMLIR-438

Technical Details

When lowering gridwise_attention_accel ops with preSoftmax fusion, the gemm0 output buffer element type was unconditionally set to elemTypeV (the values input element type). This caused a type mismatch when the preSoftmax body's linalg.generic expected a different element type for it's gemm0 based input (e.g., when the linalg.generic was truncating/extending).

Test Plan

PR CI
Original kernel from MIGraphX is passing (and turned into a LIT test)

Test Result

PR CI

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Copilot

Pull request overview

This PR fixes a crash in MIGraphX when compiling attention kernels with preSoftmax fusion. The issue occurred when lowering gridwise_attention_accel operations where the gemm0 output buffer element type was incorrectly set to the values input element type (elemTypeV), causing a type mismatch when the preSoftmax body's linalg.generic operation expected a different element type (e.g., when truncating or extending).

Changes:

Modified element type determination logic to walk the preSoftmax body and extract the correct type from the first linalg.generic operation's gemm0-based input
Added a comprehensive LIT test that reproduces the original MIGraphX failure scenario with type conversions in the preSoftmax fusion body

Reviewed changes

Copilot reviewed 2 out of 2 changed files in this pull request and generated no comments.

File	Description
mlir/lib/Dialect/Rock/Transforms/GridwiseGemmToBlockwise.cpp	Added logic to walk preSoftmax body and determine `gemmOutElemType` from the first generic's gemm0-based input, and `fusionOutElemType` from the last generic's output, fixing the element type mismatch
mlir/test/Dialect/Rock/gridwise-gemm-linalg-failure.mlir	New test file verifying correct handling of attention operations with preSoftmax fusion that performs f16 to f32 extension, ensuring the lowering produces correct buffer types

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

pabloantoniom

I second both of Daniel's comments

justinrosner added 2 commits January 20, 2026 01:28

Fusion fix

9f6e4e4

Refactor + LIT test

0e244c2

justinrosner requested review from dhernandez0 and pabloantoniom January 20, 2026 13:44

justinrosner requested a review from causten as a code owner January 20, 2026 13:44

Copilot AI review requested due to automatic review settings January 20, 2026 13:44

Copilot started reviewing on behalf of justinrosner January 20, 2026 13:47 View session

Copilot AI reviewed Jan 20, 2026

View reviewed changes

Merge branch 'develop' into justinr-linalg-fix

36ad811

dhernandez0 reviewed Jan 28, 2026

View reviewed changes

Comment thread mlir/lib/Dialect/Rock/Transforms/GridwiseGemmToBlockwise.cpp Outdated

Comment thread mlir/test/Dialect/Rock/gridwise-gemm-input-fusion-type-change.mlir

pabloantoniom reviewed Jan 28, 2026

View reviewed changes

Comment thread mlir/lib/Dialect/Rock/Transforms/GridwiseGemmToBlockwise.cpp Outdated

Comment thread mlir/lib/Dialect/Rock/Transforms/GridwiseGemmToBlockwise.cpp

Attend to review comments

7f8caf0

justinrosner requested review from dhernandez0 and pabloantoniom January 28, 2026 21:18

Merge branch 'develop' into justinr-linalg-fix

1361f2f

pabloantoniom approved these changes Jan 29, 2026

View reviewed changes

Merge branch 'develop' into justinr-linalg-fix

7ff3ad4

justinrosner merged commit ffbacc7 into develop Jan 29, 2026
9 of 16 checks passed

justinrosner deleted the justinr-linalg-fix branch January 29, 2026 17:28

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix element type mismatch in attention preSoftmax fusion#2211

Fix element type mismatch in attention preSoftmax fusion#2211
justinrosner merged 6 commits intodevelopfrom
justinr-linalg-fix

justinrosner commented Jan 20, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

pabloantoniom left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Conversation

justinrosner commented Jan 20, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

pabloantoniom left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

justinrosner commented Jan 20, 2026 •

edited

Loading