Alibaba Group Holding Limited
SYSTEMS AND METHODS FOR SCHEDULING NEURAL NETWORKS BY VARYING BATCH SIZES
Last updated:
Abstract:
The present disclosure relates to computer-implemented systems and methods for scheduling a neural network for execution. In one implementation, a system for scheduling a neural network for execution may include at least one memory storing instructions and at least one processor configured to execute the instructions to determine a profile for one or more applications co-scheduled with at least one neural network; determine a batch size for the at least one neural network based on the determined profile for the one or more applications; and scheduling the one or more applications and the at least one neural network based on the batch size.
Status:
Application
Type:
Utility
Filling date:
5 Apr 2019
Issue date:
8 Oct 2020