Alibaba Group Holding Limited
METHOD AND DEVICE FOR REDUCING A SIZE OF A NEURAL NETWORK MODEL
Last updated:
Abstract:
Methods and apparatus for reducing a size of a neural network model, the method including: compressing data of the neural network model; identifying structure information of a vector register, wherein the structure information includes a number of registers included in the vector register; comparing a number of elements in the compressed data with a first condition, wherein the first condition is determined based on the number of registers in the vector register; and in response to the number of elements satisfying the first condition, associating the compressed data with the vector register to enable loading the compressed data to the vector register.
Status:
Application
Type:
Utility
Filling date:
18 Feb 2020
Issue date:
19 Aug 2021