Microsoft Corporation
SPEECH WAVEFORM GENERATION

Last updated:

Abstract:

A method and apparatus for generating a speech waveform. Fundamental frequency information, glottal features and vocal tract features associated with an input may be received, wherein the glottal features include a phase feature, a shape feature, and an energy feature (1310). A glottal waveform is generated based on the fundamental frequency information and the glottal features through a first neural network model (1320). A speech waveform is generated based on the glottal waveform and the vocal tract features through a second neural network model (1330).

Status:
Application
Type:

Utility

Filling date:

30 Sep 2018

Issue date:

24 Jun 2021