Skip to content

[wip] plumb VDMFMA#23829

Draft
efric wants to merge 2 commits intousers/efric/VDMFMA-2from
users/efric/VDMFMA-3
Draft

[wip] plumb VDMFMA#23829
efric wants to merge 2 commits intousers/efric/VDMFMA-2from
users/efric/VDMFMA-3

Conversation

@efric
Copy link
Copy Markdown
Member

@efric efric commented Mar 18, 2026

ci-extra: test_torch

efric added 2 commits March 18, 2026 01:25
Wire VDMFMA virtual intrinsics into the kernel configuration pipeline
so they are automatically selected for skinny GEMMs (M<=8). Add e2e
tests verifying correctness on ROCM hardware.

Key changes:
- IREEGPUAttrs.cpp: Extend getVirtualIntrinsics to return VDMFMA
  intrinsics from compatible 16x16 MFMA base intrinsics
- ConfigUtils.cpp: Derive VDMFMA candidates in getMmaScheduleFromProblem
  with M-size guard to prevent selection for large M
- GPUPackToIntrinsics.cpp: Add MmaInterfaceAttr fallback for VirtualMMA
  in getPackedSizes
- config_tile_and_fuse.mlir: Add skinny GEMM config selection tests
- skinny_gemm_vdmfma.mlir: E2E ROCM correctness tests

Signed-off-by: Eric Feng <Eric.Feng@amd.com>
Signed-off-by: Eric Feng <Eric.Feng@amd.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant