International Business Machines Corporation
Multi-GPU deep learning using CPUs

Last updated:

Abstract:

A computer-implemented method, computer program product, and computer processing system are provided for accelerating neural network data parallel training in multiple graphics processing units (GPUs) using at least one central processing unit (CPU). The method includes forming a set of chunks. Each of the chunks includes a respective group of neural network layers other than a last layer. The method further includes performing one or more chunk-wise synchronization operations during a backward phase of the neural network data parallel training, by each of the multiple GPUs and the at least one CPU.

Status:
Grant
Type:

Utility

Filling date:

15 Dec 2017

Issue date:

2 Nov 2021