EXLA sharding poc #1646

polvalente · 2025-12-05T10:35:46Z

This PR adds a very minimal (and sub-optimal wrt to input slicing) sharding implementation for EXLA based on Shardy.

The goal is for us to discuss whether we want these as Nx callbacks, or if we want to add a way for EXLA to declare its own defn symbols for all_reduce/all_gather/all_to_all and related things.

My biggest concern with exposing this to Nx core is that up until now, Nx core doesn't have the concept of devices and XLA's sharding (PyTorch's as well, to what me and @Chapaman researched) is very coupled to devices.

We could very well get away with EXLA providing deftransforms that introduce :metadata Defn.Expr nodes annotating things for the EXLA.Defn to turn into EXLA.MLIR.Value calls.

polvalente added 7 commits December 5, 2025 06:02

wip: sharding poc

f5f32ee

minimal working sharding example

9d12202

feat: mesh-based slicing

2322dd6

wip

abeabbd

chore: remove set_result_sharding

6a35121

chore: remove ensure_shardy_included

4751201

chore: remove redundant func

e0c9cd5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

EXLA sharding poc #1646

EXLA sharding poc #1646

polvalente commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

EXLA sharding poc #1646

Are you sure you want to change the base?

EXLA sharding poc #1646

Conversation

polvalente commented Dec 5, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants