Microsoft Corporation
SPEAKER ADAPTATION FOR ATTENTION-BASED ENCODER-DECODER

Last updated:

Abstract:

Embodiments are associated with a speaker-independent attention-based encoder-decoder model to classify output tokens based on input speech frames, the speaker-independent attention-based encoder-decoder model associated with a first output distribution, and a speaker-dependent attention-based encoder-decoder model to classify output tokens based on input speech frames, the speaker-dependent attention-based encoder-decoder model associated with a second output distribution. The second attention-based encoder-decoder model is trained to classify output tokens based on input speech frames of a target speaker and simultaneously trained to maintain a similarity between the first output distribution and the second output distribution.

Status:
Application
Type:

Utility

Filling date:

5 Jan 2022

Issue date:

28 Apr 2022