Alibaba Group Holding Limited
VOICEPRINT RECOGNITION METHOD AND DEVICE BASED ON MEMORY BOTTLENECK FEATURE

Last updated: 11 Aug 2021

Abstract:

Implementations of the present specification provide a voiceprint recognition method and device. The method includes: extracting a first spectral feature from speaker audio; inputting the speaker audio to a memory deep neural network (DNN), and extracting a bottleneck feature from a bottleneck layer of the memory DNN, where the memory DNN includes at least one temporal recurrent layer and the bottleneck layer, an output of the at least one temporal recurrent layer is connected to the bottleneck layer; forming an acoustic feature of the speaker audio based on the first spectral feature and the bottleneck feature; extracting an identity authentication vector corresponding to the speaker audio based on the acoustic feature; and performing speaker recognition by using a classification model and based on an identity authentication vector (i-vector).

Status:

Application

Type:

Utility

Filling date:

18 Jun 2020

Issue date:

8 Oct 2020

Full patent description

Patent application document

Alibaba Group Holding Limited VOICEPRINT RECOGNITION METHOD AND DEVICE BASED ON MEMORY BOTTLENECK FEATURE

Abstract:

Alibaba Group Holding Limited
VOICEPRINT RECOGNITION METHOD AND DEVICE BASED ON MEMORY BOTTLENECK FEATURE