Fix NVFP4 QAT convert path #3450

andrewor14 · 2025-12-05T20:26:13Z

Summary: The previous convert path through QATConfig did not swap NVFP4FakeQuantizedLinear back to torch.nn.Linear. The numerics tests still passed because this fake quantized linear happen to match the PTQ numerics exactly.

Test Plan:

python test/quantization/test_qat.py -k test_qat_nvfp4

**Summary:** The previous convert path through `QATConfig` did not swap `NVFP4FakeQuantizedLinear` back to `torch.nn.Linear`. The numerics tests still passed because this fake quantized linear happen to match the PTQ numerics exactly. **Test Plan:** ``` python test/quantization/test_qat.py -k test_qat_nvfp4 ```

pytorch-bot · 2025-12-05T20:26:16Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3450

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❗ 1 Active SEVs

There are 1 currently active SEVs. If your PR is affected, please view them below:

Replace all macOS instances with nextjs due CVE-2025-55182

✅ No Failures

As of commit 048568f with merge base aa21b80 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

meta-codesync · 2025-12-05T20:28:48Z

@andrewor14 has imported this pull request. If you are a Meta employee, you can view this in D88512245.

jerryzh168 · 2025-12-06T00:35:44Z

test/quantization/test_qat.py

        sqnr = compute_error(out, baseline_out).item()
        self.assertGreaterEqual(sqnr, float("inf"))

+        # Compare converted values


oh we didn't compare convert results before?

It's tested here:

ao/test/quantization/test_qat.py

Line 2085 in 048568f

def test_quantize_api_nvfp4(self, use_per_tensor_scale: bool):

We just never explicitly checked it's using tensor subclasses after convert (tests still passed because QAT prepare mimics PTQ exactly)

andrewor14 requested a review from vkuzo December 5, 2025 20:26

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Dec 5, 2025

andrewor14 requested a review from jerryzh168 December 5, 2025 20:26

andrewor14 added the topic: improvement Use this tag if this PR is an improvement (doesn't fit into any of the other categories) label Dec 5, 2025

jerryzh168 reviewed Dec 6, 2025

View reviewed changes

jerryzh168 approved these changes Dec 6, 2025

View reviewed changes

andrewor14 merged commit 51fd90e into main Dec 7, 2025
23 of 25 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix NVFP4 QAT convert path #3450

Fix NVFP4 QAT convert path #3450

Uh oh!

andrewor14 commented Dec 5, 2025

Uh oh!

pytorch-bot bot commented Dec 5, 2025 •

edited

Loading

Uh oh!

meta-codesync bot commented Dec 5, 2025

Uh oh!

jerryzh168 Dec 6, 2025

Uh oh!

andrewor14 Dec 7, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Fix NVFP4 QAT convert path #3450

Fix NVFP4 QAT convert path #3450

Uh oh!

Conversation

andrewor14 commented Dec 5, 2025

Uh oh!

pytorch-bot bot commented Dec 5, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3450

❗ 1 Active SEVs

✅ No Failures

Uh oh!

meta-codesync bot commented Dec 5, 2025

Uh oh!

jerryzh168 Dec 6, 2025

Choose a reason for hiding this comment

Uh oh!

andrewor14 Dec 7, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

pytorch-bot bot commented Dec 5, 2025 •

edited

Loading