Apple Inc.
ALLOCATION OF MACHINE LEARNING TASKS INTO A SHARED CACHE
Last updated:
Abstract:
The subject technology receives code corresponding to a neural network (NN) model, the code including particular operations that are performed by the NN model. The subject technology determines, among the particular operations, a set of operations that are to be allocated to a cache of the electronic device that is to execute the NN model. The subject technology generates a set of cache indicators corresponding to the determined set of operations. The subject technology compiles the code and the generated set of cache indicators to provide a compiled binary for the NN model to execute on a target device.
Status:
Application
Type:
Utility
Filling date:
14 Oct 2019
Issue date:
3 Dec 2020