NVIDIA Corporation
BAYESIAN OPTIMIZATION OF SPARSITY RATIOS IN MODEL COMPRESSION
Last updated:
Abstract:
One embodiment of a method includes determining, by a Bayesian optimizer, a first sparsity ratio associated with a limit on an accuracy loss caused by compressing the machine learning model. The method further includes selecting, by the Bayesian optimizer, a second sparsity ratio that optimizes a predefined objective function for the machine learning model within a search space bounded by the first sparsity ratio. The method further includes generating a compressed version of the machine learning model having the second sparsity ratio.
Status:
Application
Type:
Utility
Filling date:
7 Feb 2020
Issue date:
4 Mar 2021