Skip to content

Pull requests: pytorch/ao

Author
Filter by author
Loading
Label
Filter by label
Loading
Use alt + click/return to exclude labels
or + click/return for logical OR
Projects
Filter by project
Loading
Milestones
Filter by milestone
Loading
Reviews
Assignee
Filter by who’s assigned
Assigned to nobody Loading
Sort

Pull requests list

hook up real nvfp4 grouped_gemm CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes
#4316 opened Apr 22, 2026 by vkuzo Contributor Loading…
make NVFP4Tensor handle per-expert outer scale CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes
#4315 opened Apr 22, 2026 by vkuzo Contributor Loading…
emulated nvfp4 support torch._grouped_mm for inference CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes
#4314 opened Apr 22, 2026 by vkuzo Contributor Loading…
gptq example: remove transformers version check CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes
#4313 opened Apr 22, 2026 by vkuzo Contributor Loading…
add gptq benchmark, and speed up by ~3x with compile CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes
#4310 opened Apr 21, 2026 by vkuzo Contributor Loading…
[X86] Re-enable and improve some test cases ciflow/rocm CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. cpu module: not user facing Use this tag if you don't want this PR to show up in release notes
#4308 opened Apr 21, 2026 by Xia-Weiwen Collaborator Loading…
[PT2E] Run weight observer eagerly for dynamic quant CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes module: pt2e_quant pt2 export quantization (prepare_pt2e, convert_pt2e, quantizer)
#4307 opened Apr 21, 2026 by Xia-Weiwen Collaborator Draft
[test][xpu] Adjust the test sequence for Intel GPU CI ciflow/xpu label used to trigger xpu CI jobs CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes topic: for developers Use this tag if this PR is mainly developer facing xpu Intel XPU related features
#4298 opened Apr 19, 2026 by zxd1997066 Contributor Loading…
fix(utils): propagate non_blocking in TorchAOBaseTensor._to_copy and _get_to_kwargs CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4297 opened Apr 19, 2026 by Dev-next-gen Loading…
add FSDP and TP tests for Float8BlockwiseLinear CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: training quantize_ api training flow
#4295 opened Apr 17, 2026 by iamzainhuda Contributor Loading…
Move collect_producer_nodes to graph_utils.py CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4294 opened Apr 17, 2026 by tom-arm Contributor Loading…
[mxfp8 moe training] fuse dynamic per-group padding into cutedsl 2d mxfp8 quant kernel CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4293 opened Apr 17, 2026 by MagellaX Contributor Loading…
basic enablement for mxfp8 and mxfp4 inference on AMD MI350x CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: inference quantize_ api inference flow
#4290 opened Apr 16, 2026 by vkuzo Contributor Loading…
[AARCH64] Enable MKLDNN Backend for Int8DynamicActivationInt8WeightConfig() on ARM CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: inference quantize_ api inference flow
#4281 opened Apr 15, 2026 by agrawal-aka Contributor Loading…
Add Sparse2x4HIPSPARSELTFloat8Tensor (#4277) CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. fb-exported meta-exported module: rocm
#4277 opened Apr 14, 2026 by bbeckca Contributor Loading…
[ROCm] Add MXFP8 training support for gfx950 (MI355X) ciflow/rocm-mi300 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: rocm
#4275 opened Apr 14, 2026 by indianspeedster Draft MXFP8 Training
[xpu][mx][test] Enable mx serialization tests on xpu CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4272 opened Apr 13, 2026 by ugolowic Contributor Loading…
[xpu][mx] Fix NaN scale propagation in RCEIL triton kernel CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4271 opened Apr 13, 2026 by ugolowic Contributor Loading…
[optim] Add GrokAdamW optimizer with low-bit quantization support CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4270 opened Apr 13, 2026 by vaibhavhariram Loading…
Add torch.uint16, torch.uint32 CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4269 opened Apr 12, 2026 by Freed-Wu Loading…
Add reduce_range to avoid overflow in int8 tensor CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: not user facing Use this tag if you don't want this PR to show up in release notes
#4266 opened Apr 10, 2026 by cyxlily Contributor Loading…
Support 32x32 scaling for weights in MXFP8 weight quantization kernel CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. module: training quantize_ api training flow moe mx
#4254 opened Apr 9, 2026 by alexsamardzic Collaborator Loading… MXFP8 Training
[nvfp4 training] add autograd support for NVFP4 emulated grouped GEMM CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4252 opened Apr 9, 2026 by roycho96 Contributor Loading…
ProTip! Add no:assignee to see everything that’s not assigned.