Intel Corporation
COMPUTE-BASED SUBGRAPH PARTITIONING OF DEEP LEARNING MODELS FOR FRAMEWORK INTEGRATION

Last updated:

Abstract:

Systems, apparatuses and methods provide technology for efficient subgraph partitioning, including generating a first set of subgraphs based on supported nodes of a model graph, wherein the supported nodes have operators that are supported by a hardware backend device, evaluating a compute efficiency of each subgraph of the first set of subgraphs with respect to the hardware backend device and to a default CPU associated with a default runtime, and selecting, from the first set of subgraphs, a second set of subgraphs to be run on the hardware backend device based on the evaluated compute efficiency. The technology can include calculating a backend performance factor for each subgraph for the hardware backend device, calculating a default performance factor for each subgraph for the default CPU, and comparing, for each respective subgraph of the of the first set of subgraphs, the backend performance factor and the default performance factor.

Status:
Application
Type:

Utility

Filling date:

24 Jun 2021

Issue date:

14 Oct 2021