[FEAT-DSL] Update FlyToROCDL Conversion to support full dsl types by sjfeng1999 · Pull Request #273 · ROCm/FlyDSL

sjfeng1999 · 2026-03-23T17:12:35Z

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Look over the contributing guidelines at https://github.com/ROCm/ROCm/blob/develop/CONTRIBUTING.md#pull-requests.

Copilot

Pull request overview

This PR updates the Fly→ROCDL lowering pipeline to better support the full Fly DSL type system (notably swizzles, composed layouts, and BufferDesc “fat pointer” handling), and aligns MLIR tests + Python DSL wrappers with the new printing/lowering behavior.

Changes:

Add BufferFatPtr helper and refactor BufferDesc pointer lowering/offsetting/copy to use it.
Extend layout lowering/type inference to handle SwizzleType in crd2idx and introduce fly.decomposition plumbing for copy expansion.
Update MLIR conversion/transform tests for formatting and i32-based GEP indices; reorganize/extend Python primitive DSL helpers.

Reviewed changes

Copilot reviewed 12 out of 15 changed files in this pull request and generated 9 comments.

Show a summary per file

File	Description
tests/mlir/Transforms/rewrite_func_signature.mlir	Adjusts expected type printing spacing in signatures.
tests/mlir/Transforms/layout_lowering.mlir	Adjusts expected ptr type printing spacing.
tests/mlir/Conversion/pointer_ops.mlir	Updates expected GEP index types/casts (i64 → i32) to match new lowering.
tests/mlir/Conversion/memref_ops.mlir	Updates expected GEP index types/casts (i64 → i32) to match new lowering.
tests/mlir/Conversion/dyn_shared.mlir	Updates expected dynamic shared GEP index type (i64 → i32).
python/flydsl/expr/primitive.py	Reorganizes and expands DSL op wrappers; adds/relocates ptr/memref helpers and “Deprecated” wrappers.
lib/Dialect/FlyROCDL/CDNA3/MmaAtom.cpp	Adjusts CDNA3 MFMA C thread/value layout construction.
lib/Dialect/Fly/Transforms/LayoutLowering.cpp	Adds `SwizzleType` handling in `crd2idx` lowering; inserts `fly.decomposition` before certain copy atom calls; improves GEMM shape handling.
lib/Dialect/Fly/IR/FlyTypeDefs.cpp	Improves printing spacing for types/attrs (commas).
lib/Dialect/Fly/IR/FlyOps.cpp	Extends `crd2idx` type inference for swizzles; adds `DecompositionOp` type inference; refines `MemRefLoadOp` inference for `CoordTensorType`.
lib/Dialect/Fly/IR/FlyAttrDefs.cpp	Adjusts parsing of leaf/basis attrs to more selectively parse `E<mode>` suffixes.
lib/Conversion/FlyToROCDL/FlyToROCDL.cpp	Refactors BufferDesc lowering through `BufferFatPtr`; adds ptr swizzle application at load/store/memcpy sites; simplifies iterator ops lowerings.
lib/Conversion/FlyToROCDL/BufferFatPtr.h	New helper implementing BufferDesc fat-pointer packing/unpacking and swizzled byte-offset computation.
include/flydsl/Dialect/Fly/Transforms/MemrefLowering.td	Adds pattern rewrites for `fly.decomposition` on memref/coord_tensor.
include/flydsl/Dialect/Fly/IR/FlyOps.td	Broadens `crd2idx` layout operand type (allows swizzle), adds `fly.decomposition` op, and tweaks assembly formats.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

lib/Dialect/Fly/Transforms/LayoutLowering.cpp

lib/Conversion/FlyToROCDL/BufferFatPtr.h

lib/Conversion/FlyToROCDL/FlyToROCDL.cpp

lib/Dialect/Fly/Transforms/LayoutLowering.cpp

lib/Conversion/FlyToROCDL/BufferFatPtr.h

python/flydsl/expr/primitive.py

coderfeli

Review Comments

Critical

i32 GEP overflow: AddOffsetOpLowering uses i32 offsets for all address spaces including Global. Offsets >2GB will overflow. Should keep i64 for non-BufferDesc pointers.
Null deref in AddOffsetOpLowering: offset.getDefiningOp() returns nullptr for block arguments, then defOp->getOperand(0) crashes. Need a null check.
Buffer-to-buffer copy dropped: lowerCDNA3BufferCopy now rejects both-sides-BufferDesc and both-sides-non-BufferDesc. Old code handled these cases. Potential regression if any kernel uses buffer-to-buffer copy.
CDNA3 MmaAtom C layout: Removed ValM1 from accumulator layout. For mfma with M>16, this changes how accumulator values distribute. Needs correctness verification across all MFMA variants.

Medium

Swizzle logic duplicated between BufferFatPtr::swizzleByteOffset and applySwizzleOnPtr — should extract a shared helper.
lowerUniversalCopy memcpy length changed from i64 to i32. Confirm LLVM::MemcpyOp accepts i32 length.
No MLIR tests for fly.decomposition or swizzle in crd2idx.
MakeViewOpLowering relies on use_empty() for CoordTensor — fragile pattern, add a comment explaining why.

coderfeli · 2026-03-24T03:43:18Z

a little conflict @sjfeng1999 after I changed the crd2idx to use in gemm kernel.

sjfeng1999 · 2026-03-24T07:12:18Z

Review Comments

Critical

i32 GEP overflow: AddOffsetOpLowering uses i32 offsets for all address spaces including Global. Offsets >2GB will overflow. Should keep i64 for non-BufferDesc pointers.

Null deref in AddOffsetOpLowering: offset.getDefiningOp() returns nullptr for block arguments, then defOp->getOperand(0) crashes. Need a null check.

Buffer-to-buffer copy dropped: lowerCDNA3BufferCopy now rejects both-sides-BufferDesc and both-sides-non-BufferDesc. Old code handled these cases. Potential regression if any kernel uses buffer-to-buffer copy.

CDNA3 MmaAtom C layout: Removed ValM1 from accumulator layout. For mfma with M>16, this changes how accumulator values distribute. Needs correctness verification across all MFMA variants.

Medium

Swizzle logic duplicated between BufferFatPtr::swizzleByteOffset and applySwizzleOnPtr — should extract a shared helper.

lowerUniversalCopy memcpy length changed from i64 to i32. Confirm LLVM::MemcpyOp accepts i32 length.

No MLIR tests for fly.decomposition or swizzle in crd2idx.

MakeViewOpLowering relies on use_empty() for CoordTensor — fragile pattern, add a comment explaining why.

The type of offset in the GEP is coherent with the integer width in the int_tuple offset. It's not always i32.
After rewrite-func-signature pass, int_tuple value can't appear at the block_arguemnts.
It's unproper case anyway. CI will check if there is a regression.
Done, it's a true problem.

Made-with: Cursor

coderfeli · 2026-03-24T15:02:44Z

@sjfeng1999 rebased on main and fixed a minor run error in the moe and gemm. take a look.

sjfeng1999 requested review from coderfeli and Copilot March 23, 2026 17:12

Copilot started reviewing on behalf of sjfeng1999 March 23, 2026 17:14 View session

Copilot AI reviewed Mar 23, 2026

View reviewed changes

coderfeli reviewed Mar 24, 2026

View reviewed changes

sjfeng1999 and others added 7 commits March 24, 2026 13:45

Add BufferFatPtr class and update rocdl conversion

6838cbf

Made-with: Cursor

Add DecompositionOp and support swizzle as the mapping of crd2idx

927bd8c

minor fix

8d6ee97

remove redundant getResult()

58eafcb

fix MFMA 32x32 layoutC

fb9de4c

add missing get_dyn_shared

1ac9971

fix gemm and moe run

a226da8

coderfeli force-pushed the pr/yodate-to-rocdl-conversion branch from f2f651c to a226da8 Compare March 24, 2026 15:01

Merge branch 'main' into pr/yodate-to-rocdl-conversion

74aa17a

coderfeli merged commit cf3ab62 into main Mar 25, 2026
8 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEAT-DSL] Update FlyToROCDL Conversion to support full dsl types#273

[FEAT-DSL] Update FlyToROCDL Conversion to support full dsl types#273
coderfeli merged 8 commits intomainfrom
pr/yodate-to-rocdl-conversion

sjfeng1999 commented Mar 23, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderfeli left a comment

Uh oh!

coderfeli commented Mar 24, 2026

Uh oh!

sjfeng1999 commented Mar 24, 2026

Review Comments

Critical

Medium

Uh oh!

coderfeli commented Mar 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

sjfeng1999 commented Mar 23, 2026

Motivation

Technical Details

Test Plan

Test Result

Submission Checklist

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

coderfeli left a comment

Choose a reason for hiding this comment

Review Comments

Critical

Medium

Uh oh!

coderfeli commented Mar 24, 2026

Uh oh!

sjfeng1999 commented Mar 24, 2026

Review Comments

Critical

Medium

Uh oh!

coderfeli commented Mar 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants