Microsoft Corporation
ADAPTIVE BATCHING TO REDUCE RECOGNITION LATENCY

Last updated:

Abstract:

Embodiments may include collection of a first batch of acoustic feature frames of an audio signal, the number of acoustic feature frames of the first batch equal to a first batch size, input of the first batch to a speech recognition network, collection, in response to detection of a word hypothesis output by the speech recognition network, of a second batch of acoustic feature frames of the audio signal, the number of acoustic feature frames of the second batch equal to a second batch size greater than the first batch size, and input of the second batch to the speech recognition network.

Status:
Application
Type:

Utility

Filling date:

27 Jan 2020

Issue date:

15 Jul 2021