-
Notifications
You must be signed in to change notification settings - Fork 493
Pull requests: pytorch/ao
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
hook up real nvfp4 grouped_gemm
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4316
opened Apr 22, 2026 by
vkuzo
Contributor
Loading…
make NVFP4Tensor handle per-expert outer scale
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4315
opened Apr 22, 2026 by
vkuzo
Contributor
Loading…
emulated nvfp4 support torch._grouped_mm for inference
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4314
opened Apr 22, 2026 by
vkuzo
Contributor
Loading…
gptq example: remove transformers version check
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4313
opened Apr 22, 2026 by
vkuzo
Contributor
Loading…
docs(finetuning): replace TorchTune QAT section with Unsloth
#4312
opened Apr 21, 2026 by
Anai-Guo
Loading…
3 tasks
add gptq benchmark, and speed up by ~3x with compile
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4310
opened Apr 21, 2026 by
vkuzo
Contributor
Loading…
[X86] Re-enable and improve some test cases
ciflow/rocm
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
cpu
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4308
opened Apr 21, 2026 by
Xia-Weiwen
Collaborator
Loading…
[PT2E] Run weight observer eagerly for dynamic quant
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
module: pt2e_quant
pt2 export quantization (prepare_pt2e, convert_pt2e, quantizer)
#4307
opened Apr 21, 2026 by
Xia-Weiwen
Collaborator
•
Draft
docs(qat): update supported configs list, fix unclickable link, add blog ref
#4305
opened Apr 21, 2026 by
Anai-Guo
Loading…
[test][xpu] Adjust the test sequence for Intel GPU CI
ciflow/xpu
label used to trigger xpu CI jobs
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
topic: for developers
Use this tag if this PR is mainly developer facing
xpu
Intel XPU related features
#4298
opened Apr 19, 2026 by
zxd1997066
Contributor
Loading…
fix(utils): propagate non_blocking in TorchAOBaseTensor._to_copy and _get_to_kwargs
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4297
opened Apr 19, 2026 by
Dev-next-gen
Loading…
add FSDP and TP tests for Float8BlockwiseLinear
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: training
quantize_ api training flow
#4295
opened Apr 17, 2026 by
iamzainhuda
Contributor
Loading…
Move collect_producer_nodes to graph_utils.py
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4294
opened Apr 17, 2026 by
tom-arm
Contributor
Loading…
[mxfp8 moe training] fuse dynamic per-group padding into cutedsl 2d mxfp8 quant kernel
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4293
opened Apr 17, 2026 by
MagellaX
Contributor
Loading…
basic enablement for mxfp8 and mxfp4 inference on AMD MI350x
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: inference
quantize_ api inference flow
#4290
opened Apr 16, 2026 by
vkuzo
Contributor
Loading…
[AARCH64] Enable MKLDNN Backend for Int8DynamicActivationInt8WeightConfig() on ARM
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: inference
quantize_ api inference flow
#4281
opened Apr 15, 2026 by
agrawal-aka
Contributor
Loading…
Add Sparse2x4HIPSPARSELTFloat8Tensor (#4277)
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
fb-exported
meta-exported
module: rocm
#4277
opened Apr 14, 2026 by
bbeckca
Contributor
Loading…
[ROCm] Add MXFP8 training support for gfx950 (MI355X)
ciflow/rocm-mi300
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: rocm
[xpu][mx][test] Enable mx serialization tests on xpu
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4272
opened Apr 13, 2026 by
ugolowic
Contributor
Loading…
[xpu][mx] Fix NaN scale propagation in RCEIL triton kernel
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4271
opened Apr 13, 2026 by
ugolowic
Contributor
Loading…
[optim] Add GrokAdamW optimizer with low-bit quantization support
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4270
opened Apr 13, 2026 by
vaibhavhariram
Loading…
Add torch.uint16, torch.uint32
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4269
opened Apr 12, 2026 by
Freed-Wu
Loading…
Add reduce_range to avoid overflow in int8 tensor
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: not user facing
Use this tag if you don't want this PR to show up in release notes
#4266
opened Apr 10, 2026 by
cyxlily
Contributor
Loading…
Support 32x32 scaling for weights in MXFP8 weight quantization kernel
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
module: training
quantize_ api training flow
moe
mx
[nvfp4 training] add autograd support for NVFP4 emulated grouped GEMM
CLA Signed
This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
#4252
opened Apr 9, 2026 by
roycho96
Contributor
Loading…
Previous Next
ProTip!
Add no:assignee to see everything that’s not assigned.