Alibaba Group Holding Limited
Systems and methods for scheduling neural networks by varying batch sizes
Last updated:
Abstract:
The present disclosure relates to computer-implemented systems and methods for scheduling a neural network for execution. In one implementation, a system for scheduling a neural network for execution may include at least one memory storing instructions and at least one processor configured to execute the instructions to determine a profile for one or more applications co-scheduled with at least one neural network; determine a batch size for the at least one neural network based on the determined profile for the one or more applications; and scheduling the one or more applications and the at least one neural network based on the batch size.
Status:
Grant
Type:
Utility
Filling date:
5 Apr 2019
Issue date:
4 May 2021