NVIDIA Corporation
TECHNIQUES FOR ORCHESTRATING STAGES OF THREAD SYNCHRONIZATION

Last updated:

Abstract:

Apparatuses, systems, and techniques to execute data-dependent parallel operations in one or more programs utilizing an application programming interface to perform parallel computing, such as CUDA, without relying on a synchronization operation between said one or more programs. For example, at least one embodiment pertains to processors or computing systems used to determine which thread in a group of threads finishes modifying shared data last, and that thread is selected to perform additional data-dependent computations from said group of threads.

Status:
Application
Type:

Utility

Filling date:

19 Mar 2020

Issue date:

23 Sep 2021