QUALCOMM Incorporated
Method, apparatus, and system for an architecture for machine learning acceleration
Last updated:
Abstract:
A method, apparatus, and system for an architecture for machine learning acceleration is presented. An apparatus includes a plurality of processing elements, each including a tightly-coupled memory, and a memory system coupled to the processing elements. A global synchronization manager is coupled to the plurality of the processing elements and to the memory system. The processing elements do not implement a coherency protocol with respect to the memory system. The processing elements implement direct memory access with respect to the memory system, and the global synchronization manager is configured to synchronize operations of the plurality of processing elements through the TCMs.
Status:
Grant
Type:
Utility
Filling date:
29 Aug 2019
Issue date:
18 May 2021