International Business Machines Corporation
Generating acoustic sequences via neural networks using combined prosody info

Last updated:

Abstract:

An example system includes a processor to receive a linguistic sequence and a prosody info offset. The processor can generate, via a trained prosody info predictor, combined prosody info including a number of observations based on the linguistic sequence. The number of observations include linear combinations of statistical measures evaluating a prosodic component over a predetermined period of time. The processor can generate, via a trained neural network, an acoustic sequence based on the combined prosody info, the prosody info offset, and the linguistic sequence.

Status:
Grant
Type:

Utility

Filling date:

12 Sep 2019

Issue date:

3 May 2022