Microsoft Corporation
Paragraph synthesis with cross utterance features for neural TTS

Last updated: 14 Sep 2022

Abstract:

The present disclosure provides a method and apparatus for generating speech through neural text-to-speech (TTS) synthesis. A text input may be obtained. A phone feature of the text input may be generated. Context features of the text input may be generated based on a set of sentences associated with the text input. A speech waveform corresponding to the text input may be generated based on the phone feature and the context features.

Status:

Application

Type:

Utility

Filling date:

17 Jun 2020

Issue date:

1 Sep 2022

Full patent description

Patent application document

Microsoft Corporation Paragraph synthesis with cross utterance features for neural TTS

Abstract:

Microsoft Corporation
Paragraph synthesis with cross utterance features for neural TTS