International Business Machines Corporation
CHUNKING AND OVERLAP DECODING STRATEGY FOR STREAMING RNN TRANSDUCERS FOR SPEECH RECOGNITION

Last updated:

Abstract:

A computer-implemented method is provided for improving accuracy recognition of digital speech. The method includes receiving the digital speech. The method further includes splitting the digital speech into overlapping chunks. The method also includes computing a bidirectional encoder embedding of each of the overlapping chunks to obtain bidirectional encoder embeddings. The method additionally includes combining the bidirectional encoder embeddings. The method further includes interpreting, by a speech recognition system, the digital speech using the combined bidirectional encoder embeddings.

Status:
Application
Type:

Utility

Filling date:

26 Feb 2021

Issue date:

1 Sep 2022