ModelPruning callback - hard pruning #19347
              
                Unanswered
              
          
                  
                    
                      ilya-SX
                    
                  
                
                  asked this question in
                code help: CV
              
            Replies: 0 comments
  
    Sign up for free
    to join this conversation on GitHub.
    Already have an account?
    Sign in to comment
  
        
    
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all,
I am performing a structural pruning using the pruning callback (
pytorch_lightning.callbacks.ModelPruning) and since it performs soft pruning only (replacing weights with zeros) I see almost no difference in model latency when converted to ONNX. It seems that to reduce model latency hard pruning (removing zero weights) should be performed.My questions are:
Thanks
Beta Was this translation helpful? Give feedback.
All reactions