Spotify Technology S.A.
Singing voice separation with deep u-net convolutional networks

Last updated: 7 Jan 2022

Abstract:

A system, method and computer product for training a neural network system. The method comprises applying an audio signal to the neural network system, the audio signal including a vocal component and a non-vocal component. The method also comprises comparing an output of the neural network system to a target signal, and adjusting at least one parameter of the neural network system to reduce a result of the comparing, for training the neural network system to estimate one of the vocal component and the non-vocal component. In one example embodiment, the system comprises a U-Net architecture. After training, the system can estimate vocal or instrumental components of an audio signal, depending on which type of component the system is trained to estimate.

Status:

Grant

Type:

Utility

Filling date:

6 Aug 2018

Issue date:

16 Feb 2021

Full patent description

Patent application document

Spotify Technology S.A. Singing voice separation with deep u-net convolutional networks

Abstract:

Spotify Technology S.A.
Singing voice separation with deep u-net convolutional networks