Adobe Inc.
Real-time speaker-dependent neural vocoder

Last updated:

Abstract:

Techniques for a recursive deep-learning approach for performing speech synthesis using a repeatable structure that splits an input tensor into a left half and right half similar to the operation of the Fast Fourier Transform, performs a 1-D convolution on each respective half, performs a summation and then applies a post-processing function. The repeatable structure may be utilized in a series configuration to operate as a vocoder or perform other speech processing functions.

Status:
Grant
Type:

Utility

Filling date:

22 Aug 2018

Issue date:

8 Sep 2020