International Business Machines Corporation
CUSTOMIZATION OF RECURRENT NEURAL NETWORK TRANSDUCERS FOR SPEECH RECOGNITION

Last updated: 13 Jul 2022

Abstract:

A computer-implemented method for customizing a recurrent neural network transducer (RNN-T) is provided. The computer implemented method includes synthesizing first domain audio data from first domain text data, and feeding the synthesized first domain audio data into a trained encoder of the recurrent neural network transducer (RNN-T) having an initial condition, wherein the encoder is updated using the synthesized first domain audio data and the first domain text data. The computer implemented method further includes synthesizing second domain audio data from second domain text data, and feeding the synthesized second domain audio data into the updated encoder of the recurrent neural network transducer (RNN-T), wherein the prediction network is updated using the synthesized second domain audio data and the second domain text data. The computer implemented method further includes restoring the updated encoder to the initial condition.

Status:

Application

Type:

Utility

Filling date:

29 Dec 2020

Issue date:

30 Jun 2022

Full patent description

Patent application document

International Business Machines Corporation CUSTOMIZATION OF RECURRENT NEURAL NETWORK TRANSDUCERS FOR SPEECH RECOGNITION

Abstract:

International Business Machines Corporation
CUSTOMIZATION OF RECURRENT NEURAL NETWORK TRANSDUCERS FOR SPEECH RECOGNITION