Intel Corporation
Data parallelism and halo exchange for distributed machine learning

Last updated:

Abstract:

One embodiment provides for a method of transmitting data between multiple compute nodes of a distributed compute system, the method comprising multi-dimensionally partitioning data of a feature map across multiple nodes for distributed training of a convolutional neural network; performing a parallel convolution operation on the multiple partitions to train weight data of the neural network; and exchanging data between nodes to enable computation of halo regions, the halo regions having dependencies on data processed by a different node.

Status:
Grant
Type:

Utility

Filling date:

12 Jan 2018

Issue date:

28 Jun 2022