Netflix, Inc.
TECHNIQUES FOR TRAINING A MULTITASK LEARNING MODEL TO ASSESS PERCEIVED AUDIO QUALITY

Last updated:

Abstract:

In various embodiments, a training application trains a multitask learning model to assess perceived audio quality. The training application computes a set of pseudo labels based on a first audio clip and multiple models. The set of pseudo labels specifies metric values for a set of metrics that are relevant to audio quality. The training application also computes a set of feature values for a set of audio features based on the first audio clip. The training application trains a multitask learning model based on the set of feature values and the set of pseudo labels to generate a trained multitask learning model. In operation, the trained multitask learning model maps different sets of feature values for the set of audio features to different sets of predicted labels. Each set of predicted labels specifies estimated metric values for the set of metrics.

Status:
Application
Type:

Utility

Filling date:

18 Jun 2020

Issue date:

11 Nov 2021