Medallia, Inc.
Use of ASR confidence to improve reliability of automatic audio redaction
Last updated:
Abstract:
A method includes removing sensitive information from an audio recording. The method includes receiving a digitally encoded audio recording; transcoding, if necessary, the audio recording to a pre-defined audio format; identifying periods of voice in/activity in the audio recording; segregating the audio recording into a sequence of separated utterances or words; streaming the sequence of separated utterances or words to an ASR server; receiving, for each streamed utterance or word, an associated ASR decoding; receiving, for each ASR decoding, an associated confidence score indicative of a probability that the associated ASR decoding is correct; identifying for redaction; any ASR decoding that contains sensitive information, and any ASR decoding with a confidence score less than a redaction threshold; and preparing a redacted audio recording by eliminating or masking those word(s) and/or utterance(s) for which an associated ASR decoding was identified for redaction.
Utility
17 Oct 2018
5 Oct 2021