QUALCOMM Incorporated
DEPTH-FIRST DEEP CONVOLUTIONAL NEURAL NETWORK INFERENCE
Last updated:
Abstract:
A method performed by a computing device includes determining a partition for depth-first processing by a multi-layer artificial neural network (ANN) of the computing device. The computing device comprising a processor, on-chip memory, and off-chip memory. The first partition determined based on an amount of on-chip memory used by the first partition, an available amount of on-chip memory, and a size of a write back to the off-chip memory. The method also includes processing, at the device via the multi-layer ANN, an input, using the depth-first processing in accordance with the partition.
Status:
Grant
Type:
Utility
Filling date:
14 Dec 2020
Issue date:
17 Jun 2021