International Business Machines Corporation
CHUNKING AND OVERLAP DECODING STRATEGY FOR STREAMING RNN TRANSDUCERS FOR SPEECH RECOGNITION
Last updated:
Abstract:
A computer-implemented method is provided for improving accuracy recognition of digital speech. The method includes receiving the digital speech. The method further includes splitting the digital speech into overlapping chunks. The method also includes computing a bidirectional encoder embedding of each of the overlapping chunks to obtain bidirectional encoder embeddings. The method additionally includes combining the bidirectional encoder embeddings. The method further includes interpreting, by a speech recognition system, the digital speech using the combined bidirectional encoder embeddings.
Status:
Application
Type:
Utility
Filling date:
26 Feb 2021
Issue date:
1 Sep 2022