Apple Inc.
Inverse text normalization for automatic speech recognition

Last updated: 23 Jul 2021

Abstract:

Techniques for inverse text normalization are provided. In some examples, speech input is received and a spoken-form text representation of the speech input is generated. The spoken-form text representation includes a token sequence. A feature representation is determined for the spoken-form text representation and a sequence of labels is determined based on the feature representation. The sequence of labels is assigned to the token sequence and specifies a plurality of edit operations to perform on the token sequence. Each edit operation of the plurality of edit operations corresponds to one of a plurality of predetermined types of edit operations. A written-form text representation of the speech input is generated by applying the plurality of edit operations to the token sequence in accordance with the sequence of labels. A task responsive to the speech input is performed using the generated written-form text representation.

Status:

Grant

Type:

Utility

Filling date:

29 Jun 2018

Issue date:

17 Mar 2020

Full patent description

Patent application document

Apple Inc. Inverse text normalization for automatic speech recognition

Abstract:

Apple Inc.
Inverse text normalization for automatic speech recognition