Adobe Inc.
Real-time speaker-dependent neural vocoder

Last updated: 27 Jul 2021

Abstract:

Techniques for a recursive deep-learning approach for performing speech synthesis using a repeatable structure that splits an input tensor into a left half and right half similar to the operation of the Fast Fourier Transform, performs a 1-D convolution on each respective half, performs a summation and then applies a post-processing function. The repeatable structure may be utilized in a series configuration to operate as a vocoder or perform other speech processing functions.

Status:

Grant

Type:

Utility

Filling date:

22 Aug 2018

Issue date:

8 Sep 2020

Full patent description

Patent application document

Adobe Inc. Real-time speaker-dependent neural vocoder

Abstract:

Adobe Inc.
Real-time speaker-dependent neural vocoder