Optional: use inv_scale for WeightTensorWithLinearActivationScaleMetadata #3176

Xia-Weiwen · 2025-10-15T07:46:14Z

Summary
Use inv_scale on CPU to avoid division at runtime to improve performance. Used for SmoothQuant.

Test plan
pytest -sv test/prototype/test_smoothquant.py

pytorch-bot · 2025-10-15T07:46:18Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3176

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit c7e242c with merge base ff16308 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

…data

Xia-Weiwen · 2025-10-20T06:56:36Z

Hi @jerryzh168 Can you please review this PR? Thanks.

jerryzh168

we are deprecating these utils though to_weight_tensor_with_linear_activation_scale_metadata

jerryzh168

we are deprecating these utils though to_weight_tensor_with_linear_activation_scale_metadata

Xia-Weiwen · 2025-10-22T08:12:02Z

we are deprecating these utils though to_weight_tensor_with_linear_activation_scale_metadata

Hi @jerryzh168 Thanks for reviewing. This PR modifies the to_weight_tensor_with_linear_activation_scale_metadata path, which is defined here:

ao/torchao/quantization/linear_activation_scale.py

Lines 118 to 120 in 7e68d5e

    
           to_weight_tensor_with_linear_activation_scale_metadata = ( 
        
               WeightTensorWithLinearActivationScaleMetadata.from_float 
        
           )

And this PR adds a new argument use_inv_scale to WeightTensorWithLinearActivationScaleMetadata.from_float:

ao/torchao/quantization/linear_activation_scale.py

Lines 73 to 78 in 7e68d5e

    
           def from_float( 
        
               cls, 
        
               input_float: torch.Tensor, 
        
               scale: torch.Tensor, 
        
           ): 
        
               return cls(input_float, scale)

and modifies the implementation of the WeightTensorWithLinearActivationScaleMetadata.

Can you please review again? Thanks.

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 15, 2025

Xia-Weiwen added the topic: not user facing Use this tag if you don't want this PR to show up in release notes label Oct 15, 2025

Optional: use inv_scale for WeightTensorWithLinearActivationScaleMeta…

c7e242c

…data

Xia-Weiwen marked this pull request as ready for review October 16, 2025 02:11

Xia-Weiwen requested a review from jerryzh168 October 16, 2025 02:12

jerryzh168 requested changes Oct 21, 2025

View reviewed changes

Xia-Weiwen requested a review from jerryzh168 October 22, 2025 08:12

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optional: use inv_scale for WeightTensorWithLinearActivationScaleMetadata #3176

Optional: use inv_scale for WeightTensorWithLinearActivationScaleMetadata #3176

Uh oh!

Xia-Weiwen commented Oct 15, 2025

Uh oh!

pytorch-bot bot commented Oct 15, 2025 •

edited

Loading

Uh oh!

Xia-Weiwen commented Oct 20, 2025

Uh oh!

jerryzh168 left a comment

Uh oh!

jerryzh168 left a comment

Uh oh!

Xia-Weiwen commented Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Optional: use inv_scale for WeightTensorWithLinearActivationScaleMetadata #3176

Are you sure you want to change the base?

Optional: use inv_scale for WeightTensorWithLinearActivationScaleMetadata #3176

Uh oh!

Conversation

Xia-Weiwen commented Oct 15, 2025

Uh oh!

pytorch-bot bot commented Oct 15, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/3176

✅ No Failures

Uh oh!

Xia-Weiwen commented Oct 20, 2025

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

jerryzh168 left a comment

Choose a reason for hiding this comment

Uh oh!

Xia-Weiwen commented Oct 22, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

pytorch-bot bot commented Oct 15, 2025 •

edited

Loading