Microsoft Corporation
Speaker recognition/location using neural network
Last updated:
Abstract:
Computing devices and methods utilizing a joint speaker location/speaker identification neural network are provided. In one example a computing device receives an audio signal of utterances spoken by multiple persons. Magnitude and phase information features are extracted from the signal and inputted into a joint speaker location and speaker identification neural network. The neural network utilizes both the magnitude and phase information features to determine a change in the person speaking. Output comprising the determination of the change is received from the neural network. The output is then used to perform a speaker recognition function, speaker location function, or both.
Status:
Grant
Type:
Utility
Filling date:
27 Feb 2020
Issue date:
11 Jan 2022