International Business Machines Corporation
Generating acoustic sequences via neural networks using combined prosody info
Last updated:
Abstract:
An example system includes a processor to receive a linguistic sequence and a prosody info offset. The processor can generate, via a trained prosody info predictor, combined prosody info including a number of observations based on the linguistic sequence. The number of observations include linear combinations of statistical measures evaluating a prosodic component over a predetermined period of time. The processor can generate, via a trained neural network, an acoustic sequence based on the combined prosody info, the prosody info offset, and the linguistic sequence.
Status:
Grant
Type:
Utility
Filling date:
12 Sep 2019
Issue date:
3 May 2022