Adobe Inc.
USING A PREDICTIVE MODEL TO AUTOMATICALLY ENHANCE AUDIO HAVING VARIOUS AUDIO QUALITY ISSUES

Last updated:

Abstract:

Operations of a method include receiving a request to enhance a new source audio. Responsive to the request, the new source audio is input into a prediction model that was previously trained. Training the prediction model includes providing a generative adversarial network including the prediction model and a discriminator. Training data is obtained including tuples of source audios and target audios, each tuple including a source audio and a corresponding target audio. During training, the prediction model generates predicted audios based on the source audios. Training further includes applying a loss function to the predicted audios and the target audios, where the loss function incorporates a combination of a spectrogram loss and an adversarial loss. The prediction model is updated to optimize that loss function. After training, based on the new source audio, the prediction model generates a new predicted audio as an enhanced version of the new source audio.

Status:
Application
Type:

Utility

Filling date:

30 Apr 2020

Issue date:

4 Nov 2021