Meta Platforms, Inc.
DYNAMIC QUALITY OF SERVICE MANAGEMENT FOR DEEP LEARNING TRAINING COMMUNICATION

Last updated:

Abstract:

A processor analyzes a machine learning workload. Corresponding priority levels are assigned to identified data requests in the machine learning workload based on an associated data dependency delay performance impact. The assigned corresponding priority levels are indicated when providing the data requests to a memory controller. The memory controller sorts the received data requests into a plurality of different priority queues based on the indicated corresponding priority levels. The memory controller initiates the data requests from the different priority queues to memory in an order based on different qualities of service of the different priority queues.

Status:
Application
Type:

Utility

Filling date:

24 Mar 2020

Issue date:

30 Sep 2021