Microsoft Corporation
Speaker recognition/location using neural network

Last updated:

Abstract:

Computing devices and methods utilizing a joint speaker location/speaker identification neural network are provided. In one example a computing device receives an audio signal of utterances spoken by multiple persons. Magnitude and phase information features are extracted from the signal and inputted into a joint speaker location and speaker identification neural network. The neural network utilizes both the magnitude and phase information features to determine a change in the person speaking. Output comprising the determination of the change is received from the neural network. The output is then used to perform a speaker recognition function, speaker location function, or both.

Status:
Grant
Type:

Utility

Filling date:

27 Feb 2020

Issue date:

11 Jan 2022