Microsoft Corporation
Paragraph synthesis with cross utterance features for neural TTS
Last updated:
Abstract:
The present disclosure provides a method and apparatus for generating speech through neural text-to-speech (TTS) synthesis. A text input may be obtained. A phone feature of the text input may be generated. Context features of the text input may be generated based on a set of sentences associated with the text input. A speech waveform corresponding to the text input may be generated based on the phone feature and the context features.
Status:
Application
Type:
Utility
Filling date:
17 Jun 2020
Issue date:
1 Sep 2022