International Business Machines Corporation
Dynamically resizing minibatch in neural network execution
Last updated:
Abstract:
A minibatch in a neural network execution may be dynamically resized based on on-chip memory. For example, a size of the minibatch is configured such that the minibatch fits within on-chip memory. The size of the minibatch may be resized for a sequence of layers in the neural network execution. A next layer's execution can commence responsive to the resized minibatch being completed in a previous layer without having to wait for all of the minibatch to be completed in the previous layer.
Status:
Grant
Type:
Utility
Filling date:
25 Mar 2019
Issue date:
7 Jun 2022