Microsoft Corporation
TOKEN-POSITION HANDLING FOR SEQUENCE BASED NEURAL NETWORKS
Last updated:
Abstract:
Embodiments of the present disclosure include a method for token-position handling comprising: processing a first sequence of tokens to produce a second sequence of tokens, wherein the second sequence of tokens has a smaller number of tokens than the first sequence of tokens; masking at least some tokens in the second sequence to produce masked tokens; moving the masked tokens to the beginning of the second sequence to produce a third sequence; encoding tokens in the third sequence into a set of numeric vectors in a first array; and processing the first array in a transformer neural network to determine correlations among the third sequence, the processing the first array producing a second array.
Status:
Application
Type:
Utility
Filling date:
14 Apr 2020
Issue date:
14 Oct 2021