International Business Machines Corporation
TEXT TO SPEECH PROMPT TUNING BY EXAMPLE
Last updated:
Abstract:
According to one embodiment, a method, computer system, and computer program product for customizing the rendering of a synthesized speech prompt is provided. The present invention may include extracting prosodic information from a received audio recording of a prompt by parsing the text corresponding with the prompt and generating phonetic units, aligning the phonetic units with the audio recording, and calculating, based on the alignment, prosodic values for the phonetic units. The invention may further include adapting the prosodic values to match a text-to-speech voice in use, and then synthesizing speech for the prompt based upon the adapted prosodic information.
Status:
Application
Type:
Utility
Filling date:
4 Mar 2020
Issue date:
9 Sep 2021