International Business Machines Corporation
Half-precision floating-point arrays at low overhead

Last updated:

Abstract:

Methods and systems of matrix multiplication are described. In an example, a processor can multiply a first entry of a first vector of a first data array with a second vector of a second data array to generate a third vector of a third data array. The processor can store the third vector of the third data array in the second register file. The processor can multiply a second entry of the first vector with the second vector to generate a fourth vector of the third data array. The processor can store the fourth vector of the third data array in the second register file. The processor can combine vectors of the third data array that are stored in the second register file to produce the third data array.

Status:
Grant
Type:

Utility

Filling date:

16 Aug 2019

Issue date:

22 Mar 2022