Intel Corporation
Instructions and logic to perform floating point and integer operations for machine learning
Last updated:
Abstract:
A processing apparatus is provided comprising a multiprocessor having a multithreaded architecture. The multiprocessor can execute at least one single instruction to perform parallel mixed precision matrix operations. In one embodiment the apparatus includes a memory interface and an array of multiprocessors coupled to the memory interface. At least one multiprocessor in the array of multiprocessors is configured to execute a fused multiply-add instruction in parallel across multiple threads.
Status:
Grant
Type:
Utility
Filling date:
5 Feb 2021
Issue date:
3 Aug 2021