VMware, Inc.
Interference-aware scheduling service for virtual GPU enabled systems
Last updated:
Abstract:
Disclosed are aspects of interference-aware virtual machine assignment for systems that include graphics processing units (GPUs) that are virtual GPU (vGPU) enabled. In some examples, a plurality of workloads are executed alone and co-located with other workloads in a virtual graphics processing unit (vGPU)-enabled system to determine baseline parameters and measured interferences. A machine learning model is trained to predict interference based on the measured interferences and the baseline parameters. A workload is assigned and executed on a particular GPU associated with a minimum predicted interference with the workload based on currently-assigned workloads of the particular GPU.
Status:
Grant
Type:
Utility
Filling date:
5 Jun 2019
Issue date:
7 Sep 2021