-
Notifications
You must be signed in to change notification settings - Fork 114
Pull requests: meta-pytorch/monarch
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
: wait for alloc completion in ProcMesh::stop()
ciflow/rocm
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
module: rocm
#2098
opened Dec 10, 2025 by
shayne-fletcher
Loading…
add monarch serve torchx command to launch the (MAST) job and cache the command inside the jobs .pkl
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2097
opened Dec 9, 2025 by
colin2328
Loading…
Fix otel teardown bug during unit tests
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2095
opened Dec 9, 2025 by
vidhyav
Loading…
Temporary Commit at 12/8/2025, 1:59:52 PM
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2093
opened Dec 9, 2025 by
samlurye
Loading…
Load nccl dynamically
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2088
opened Dec 8, 2025 by
zdevito
Loading…
Link statically
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2085
opened Dec 8, 2025 by
zdevito
Loading…
Change ProcMeshAgent timeout from Stopped to Failed
ciflow/rocm
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
module: rocm
#2081
opened Dec 6, 2025 by
dulinriley
Loading…
[monarch] Introduce pytest marker to isolate tests in subprocess
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2076
opened Dec 5, 2025 by
samlurye
Loading…
[ROCM] Hipify Monarch
CLA Signed
This label is managed by the Meta Open Source bot.
module: rocm
#2073
opened Dec 5, 2025 by
zstreet87
Loading…
: remove controller health monitoring
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2058
opened Dec 5, 2025 by
shayne-fletcher
Loading…
break dep on hyperactor_multiprocess
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2052
opened Dec 4, 2025 by
shayne-fletcher
Loading…
: remomve dependence on hyperactor_multiprocess
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2048
opened Dec 4, 2025 by
shayne-fletcher
Loading…
Quick fix for T246995730
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2031
opened Dec 3, 2025 by
thomasywang
Loading…
Add ARM workflow for GB200 support
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2029
opened Dec 2, 2025 by
allenwang28
Loading…
[Builds] Add separate Rust build features for core, rdma and tensor_engine
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2020
opened Dec 1, 2025 by
allenwang28
Loading…
Replace regex Captures::get(0).unwrap() with get_match()
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#2014
opened Nov 27, 2025 by
ship-it-ship-it
Loading…
[ROCm][CI] First draft of ROCm build workflow
ciflow/rocm
CLA Signed
This label is managed by the Meta Open Source bot.
module: rocm
#2001
opened Nov 26, 2025 by
jithunnair-amd
•
Draft
Remove torch-op special call path in tensor engine
CLA Signed
This label is managed by the Meta Open Source bot.
#1986
opened Nov 24, 2025 by
zdevito
Loading…
Fix CQS signal F811 in fbcode/monarch/python/monarch/controller
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#1983
opened Nov 24, 2025 by
facebook-github-bot
Loading…
Set LocalAlloc's transport to be Local
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#1981
opened Nov 24, 2025 by
pzhan9
Loading…
fbcode/monarch/docs/source/examples/distributed_tensors.py
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#1980
opened Nov 24, 2025 by
facebook-github-bot
Loading…
fbcode/monarch/docs/source/examples/distributed_tensors.py
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#1979
opened Nov 24, 2025 by
pzhan9
Loading…
Test against stable pytorch instead of nightly
CLA Signed
This label is managed by the Meta Open Source bot.
fb-exported
meta-exported
#1973
opened Nov 21, 2025 by
dulinriley
Loading…
Bug in telemetry doesn't collect metrics
CLA Signed
This label is managed by the Meta Open Source bot.
#1964
opened Nov 21, 2025 by
vidhyav
Loading…
Previous Next
ProTip!
Follow long discussions with comments:>50.