NVIDIA Corporation
TECHNIQUE FOR COMPUTATIONAL NESTED PARALLELISM
Last updated:
Abstract:
One embodiment of the present invention sets forth a technique for performing nested kernel execution within a parallel processing subsystem. The technique involves enabling a parent thread to launch a nested child grid on the parallel processing subsystem, and enabling the parent thread to perform a thread synchronization barrier on the child grid for proper execution semantics between the parent thread and the child grid. This technique advantageously enables the parallel processing subsystem to perform a richer set of programming constructs, such as conditionally executed and nested operations and externally defined library functions without the additional complexity of CPU involvement.
Status:
Application
Type:
Utility
Filling date:
5 Feb 2021
Issue date:
11 Nov 2021