Microsoft Corporation
SPREAD NEURAL NETWORKS

Last updated:

Abstract:

Techniques for training neural networks are provided. According to one set of embodiments, a first array is processed in a spreading component to produce a second array, where a first dimension of the first array corresponds to at least one sequence of approximately orthogonal numeric vectors representing tokens, and where the spreading component combines values along the first dimension. The second array is processed in a transformer neural network to determine correlations between the sequence, which produces a third array. One or more batches of the third array are processed in a de-spreading component to produce a fourth array.

Status:
Application
Type:

Utility

Filling date:

14 Apr 2020

Issue date:

14 Oct 2021