Apple Inc.
FAST DEEP LEARNING FULLY-CONNECTED COLUMN-MAJOR IMPLEMENTATION
Last updated:
Abstract:
This application relates to classifying information using a fully-connected layer of a convolutional neural network. A method for classifying information using a fully-connected layer of a convolutional neural network includes calculating a first partial output for a first block of elements by performing a dot product operation using a first row of elements of the first block of elements and a first weight block, where the first row of elements of the first block of elements corresponds to a first batch of elements. The method further includes generating a first output element using the first partial output for the first block of elements and at least one other partial output corresponding to the first batch of elements.
Utility
11 Sep 2019
12 Nov 2020