Alibaba Group Holding Limited
Systems and methods for scheduling neural networks by varying batch sizes

Last updated:

Abstract:

The present disclosure relates to computer-implemented systems and methods for scheduling a neural network for execution. In one implementation, a system for scheduling a neural network for execution may include at least one memory storing instructions and at least one processor configured to execute the instructions to determine a profile for one or more applications co-scheduled with at least one neural network; determine a batch size for the at least one neural network based on the determined profile for the one or more applications; and scheduling the one or more applications and the at least one neural network based on the batch size.

Status:
Grant
Type:

Utility

Filling date:

5 Apr 2019

Issue date:

4 May 2021