Microsoft Corporation
ACCELERATING PROCESSING BASED ON SPARSITY FOR NEURAL NETWORK HARDWARE PROCESSORS
Last updated:
Abstract:
Embodiments of the present disclosure include systems and methods for accelerating processing based on sparsity for neural network hardware processors. An input manager determines a pair of non-zero values from a pair of data streams in a plurality of pairs of data streams and retrieve the pair of non-zero values from the pair of data streams. A multiplier performs a multiplication operation on the pair of non-zero values and generate a product of the pair of non-zero values. An accumulator manager receives the product of the pair of non-zero values from the multiplier and sends the product of the pair of non-zero values to a corresponding accumulator in a plurality of accumulators.
Status:
Application
Type:
Utility
Filling date:
14 Jan 2021
Issue date:
14 Jul 2022