International Business Machines Corporation
HIERARCHICAL DECENTRALIZED DISTRIBUTED DEEP LEARNING TRAINING
Last updated:
Abstract:
Embodiments of a method are disclosed. The method includes performing a batch of decentralized deep learning training for a machine learning model in coordination with multiple local homogenous learners on a deep learning training compute node, and in coordination with multiple super learners on corresponding deep learning training compute nodes. The method also includes exchanging communications with the super learners in accordance with an asynchronous decentralized parallel stochastic gradient descent (ADPSGD) protocol. The communications are associated with the batch of deep learning training.
Status:
Application
Type:
Utility
Filling date:
22 Jul 2020
Issue date:
27 Jan 2022