Alibaba Group Holding Limited
VOICEPRINT RECOGNITION METHOD AND DEVICE BASED ON MEMORY BOTTLENECK FEATURE

Last updated:

Abstract:

Implementations of the present specification provide a voiceprint recognition method and device. The method includes: extracting a first spectral feature from speaker audio; inputting the speaker audio to a memory deep neural network (DNN), and extracting a bottleneck feature from a bottleneck layer of the memory DNN, where the memory DNN includes at least one temporal recurrent layer and the bottleneck layer, an output of the at least one temporal recurrent layer is connected to the bottleneck layer; forming an acoustic feature of the speaker audio based on the first spectral feature and the bottleneck feature; extracting an identity authentication vector corresponding to the speaker audio based on the acoustic feature; and performing speaker recognition by using a classification model and based on an identity authentication vector (i-vector).

Status:
Application
Type:

Utility

Filling date:

18 Jun 2020

Issue date:

8 Oct 2020