Microsoft Corporation
SPEECH WAVEFORM GENERATION
Last updated:
Abstract:
A method and apparatus for generating a speech waveform. Fundamental frequency information, glottal features and vocal tract features associated with an input may be received, wherein the glottal features include a phase feature, a shape feature, and an energy feature (1310). A glottal waveform is generated based on the fundamental frequency information and the glottal features through a first neural network model (1320). A speech waveform is generated based on the glottal waveform and the vocal tract features through a second neural network model (1330).
Status:
Application
Type:
Utility
Filling date:
30 Sep 2018
Issue date:
24 Jun 2021