Microsoft Corporation
Paragraph synthesis with cross utterance features for neural TTS

Last updated:

Abstract:

The present disclosure provides a method and apparatus for generating speech through neural text-to-speech (TTS) synthesis. A text input may be obtained. A phone feature of the text input may be generated. Context features of the text input may be generated based on a set of sentences associated with the text input. A speech waveform corresponding to the text input may be generated based on the phone feature and the context features.

Status:
Application
Type:

Utility

Filling date:

17 Jun 2020

Issue date:

1 Sep 2022