QUALCOMM Incorporated
DEPTH-FIRST DEEP CONVOLUTIONAL NEURAL NETWORK INFERENCE

Last updated:

Abstract:

A method performed by a computing device includes determining a partition for depth-first processing by a multi-layer artificial neural network (ANN) of the computing device. The computing device comprising a processor, on-chip memory, and off-chip memory. The first partition determined based on an amount of on-chip memory used by the first partition, an available amount of on-chip memory, and a size of a write back to the off-chip memory. The method also includes processing, at the device via the multi-layer ANN, an input, using the depth-first processing in accordance with the partition.

Status:
Grant
Type:

Utility

Filling date:

14 Dec 2020

Issue date:

17 Jun 2021