Spotify Technology S.A.
SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS

Last updated: 19 Jan 2022

Abstract:

A system, method and computer product for training a neural network system. The method comprises applying an audio signal to the neural network system, the audio signal including a vocal component and a non-vocal component. The method also comprises comparing an output of the neural network system to a target signal, and adjusting at least one parameter of the neural network system to reduce a result of the comparing, for training the neural network system to estimate one of the vocal component and the non-vocal component. In one example embodiment, the system comprises a U-Net architecture. After training, the system can estimate vocal or instrumental components of an audio signal, depending on which type of component the system is trained to estimate.

Status:

Application

Type:

Utility

Filling date:

28 Dec 2020

Issue date:

19 Aug 2021

Full patent description

Patent application document

Spotify Technology S.A. SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS

Abstract:

Spotify Technology S.A.
SINGING VOICE SEPARATION WITH DEEP U-NET CONVOLUTIONAL NETWORKS