Microsoft Corporation
MODEL COMPRESSION BY SPARSITY-INDUCING REGULARIZATION OPTIMIZATION
Last updated:
Abstract:
The performance of a neural network (NN) and/or deep neural network (DNN) can limited by the number of operations being performed as well as management of data among the various memory components of the NN/DNN. A sparsity-inducing regularization optimization process is performed on a machine learning model to generate a compressed machine learning model. A machine learning model is trained using a first set of training data. A sparsity-inducing regularization optimization process is executed on the machine learning model. Based on the sparsity-inducing regularization optimization process, a compressed machine learning model is received. The compressed machine learning model is executed to generate one or more outputs.
Utility
1 Jun 2020
16 Dec 2021