International Business Machines Corporation
METRIC LEARNING OF SPEAKER DIARIZATION
Last updated:
Abstract:
A computer-implemented method includes obtaining training data including utterances of speakers in acoustic conditions, preparing at least one machine learning model, each machine learning model including a common embedding model for converting an utterance into a feature vector and a classification model for classifying the feature vector, and training, by using the training data, the machine learning model to perform classification by speaker and to perform classification by acoustic condition.
Status:
Application
Type:
Utility
Filling date:
3 Mar 2020
Issue date:
9 Sep 2021