Intuit Inc.
Contextual multi-channel speech to text

Last updated:

Abstract:

A method for improving a transcription may include identifying, in the transcription, reliable channel tokens of an utterance of a reliable channel and an unreliable channel token of an utterance of an unreliable channel, and generating, using a machine learning model, a vector embedding for the unreliable channel token and vector embeddings for the reliable channel tokens. The method may further include calculating vector distances between the vector embedding and the vector embeddings, and generating, for the unreliable channel token and using the vector distances, a score corresponding to a reliable channel token. The method may further include determining that the score is within a threshold score, and in response to determining that the score is within the threshold score, replacing, in the transcription, the unreliable channel token with the reliable channel token.

Status:
Grant
Type:

Utility

Filling date:

24 Jan 2020

Issue date:

9 Nov 2021