Meta Platforms, Inc.
DYNAMIC QUALITY OF SERVICE MANAGEMENT FOR DEEP LEARNING TRAINING COMMUNICATION
Last updated:
Abstract:
A processor analyzes a machine learning workload. Corresponding priority levels are assigned to identified data requests in the machine learning workload based on an associated data dependency delay performance impact. The assigned corresponding priority levels are indicated when providing the data requests to a memory controller. The memory controller sorts the received data requests into a plurality of different priority queues based on the indicated corresponding priority levels. The memory controller initiates the data requests from the different priority queues to memory in an order based on different qualities of service of the different priority queues.
Status:
Application
Type:
Utility
Filling date:
24 Mar 2020
Issue date:
30 Sep 2021