International Business Machines Corporation
Dynamically resizing minibatch in neural network execution

Last updated:

Abstract:

A minibatch in a neural network execution may be dynamically resized based on on-chip memory. For example, a size of the minibatch is configured such that the minibatch fits within on-chip memory. The size of the minibatch may be resized for a sequence of layers in the neural network execution. A next layer's execution can commence responsive to the resized minibatch being completed in a previous layer without having to wait for all of the minibatch to be completed in the previous layer.

Status:
Grant
Type:

Utility

Filling date:

25 Mar 2019

Issue date:

7 Jun 2022