NVIDIA Corporation
TECHNIQUES FOR MODIFYING AND TRAINING A NEURAL NETWORK
Last updated:
Abstract:
Apparatuses, systems, and techniques are described herein to speed up inferencing in a neural network by copying output from one layer of the neural network to another computing resource based on dependencies among layers in the network. In at least one embodiment, a processor comprising one or more circuits causes two or more subsequent layers of one or more neural networks to be performed on separate computing resources from a previous layer of the one or more neural networks.
Status:
Application
Type:
Utility
Filling date:
27 May 2020
Issue date:
2 Dec 2021