NVIDIA Corporation
TECHNIQUE FOR COMPUTATIONAL NESTED PARALLELISM

Last updated:

Abstract:

One embodiment of the present invention sets forth a technique for performing nested kernel execution within a parallel processing subsystem. The technique involves enabling a parent thread to launch a nested child grid on the parallel processing subsystem, and enabling the parent thread to perform a thread synchronization barrier on the child grid for proper execution semantics between the parent thread and the child grid. This technique advantageously enables the parallel processing subsystem to perform a richer set of programming constructs, such as conditionally executed and nested operations and externally defined library functions without the additional complexity of CPU involvement.

Status:
Application
Type:

Utility

Filling date:

5 Feb 2021

Issue date:

11 Nov 2021