Is this a new feature, an improvement, or a change to existing functionality?
New Feature
How would you describe the priority of this feature request?
High
Please provide a clear description of problem this feature solves
Integrating NCCL with CuTile-Python enables compute-communication operator fusion.
Feature Description
Are there plans to integrate NCCL as the communication backend? Thx.
Describe your ideal solution
def allgather_gemm():
nccl_device.allgather()
gemm()
Describe any alternatives you have considered
No response
Additional context
No response
Contributing Guidelines