NVIDIA Corporation
TECHNIQUES FOR ORCHESTRATING STAGES OF THREAD SYNCHRONIZATION
Last updated:
Abstract:
Apparatuses, systems, and techniques to execute data-dependent parallel operations in one or more programs utilizing an application programming interface to perform parallel computing, such as CUDA, without relying on a synchronization operation between said one or more programs. For example, at least one embodiment pertains to processors or computing systems used to determine which thread in a group of threads finishes modifying shared data last, and that thread is selected to perform additional data-dependent computations from said group of threads.
Status:
Application
Type:
Utility
Filling date:
19 Mar 2020
Issue date:
23 Sep 2021