Microsoft Corporation
SYNTHETIC DATA GENERATION FOR TRAINING OF NATURAL LANGUAGE UNDERSTANDING MODELS

Last updated:

Abstract:

This document relates to machine learning. One example includes a method or technique that can be performed on a computing device. The method or technique can include obtaining a task-adapted generative model that has been tuned using one or more task-specific seed examples. The method or technique can also include inputting dialog acts into the task-adapted generative model and obtaining synthetic utterances that are output by the task-adapted generative model. The method or technique can also include populating a synthetic training corpus with synthetic training examples that include the synthetic utterances. The synthetic training corpus may be suitable for training a natural language understanding model.

Status:
Application
Type:

Utility

Filling date:

15 Sep 2020

Issue date:

17 Mar 2022