International Business Machines Corporation
METRIC LEARNING OF SPEAKER DIARIZATION

Last updated:

Abstract:

A computer-implemented method includes obtaining training data including utterances of speakers in acoustic conditions, preparing at least one machine learning model, each machine learning model including a common embedding model for converting an utterance into a feature vector and a classification model for classifying the feature vector, and training, by using the training data, the machine learning model to perform classification by speaker and to perform classification by acoustic condition.

Status:
Application
Type:

Utility

Filling date:

3 Mar 2020

Issue date:

9 Sep 2021