Microsoft Corporation
SPEAKER ADAPTATION FOR ATTENTION-BASED ENCODER-DECODER

Last updated: 18 May 2022

Abstract:

Embodiments are associated with a speaker-independent attention-based encoder-decoder model to classify output tokens based on input speech frames, the speaker-independent attention-based encoder-decoder model associated with a first output distribution, and a speaker-dependent attention-based encoder-decoder model to classify output tokens based on input speech frames, the speaker-dependent attention-based encoder-decoder model associated with a second output distribution. The second attention-based encoder-decoder model is trained to classify output tokens based on input speech frames of a target speaker and simultaneously trained to maintain a similarity between the first output distribution and the second output distribution.

Status:

Application

Type:

Utility

Filling date:

5 Jan 2022

Issue date:

28 Apr 2022

Full patent description

Patent application document

Microsoft Corporation SPEAKER ADAPTATION FOR ATTENTION-BASED ENCODER-DECODER

Abstract:

Microsoft Corporation
SPEAKER ADAPTATION FOR ATTENTION-BASED ENCODER-DECODER