International Business Machines Corporation
Adversarial training data augmentation data for text classifiers

Last updated: 18 Aug 2021

Abstract:

An intelligent computer platform to introduce adversarial training to natural language processing (NLP). An initial training set is modified with synthetic training data to create an adversarial training set. The modification includes use of natural language understanding (NLU) to parse the initial training set into components and identify component categories. One or more paraphrase terms are identified with respect to the components and component categories, and function as replacement terms. The synthetic training data is effectively a merging of the initial training set with the replacement terms. As input is presented, a classifier leverages the adversarial training set to identify the intent of the input and to output a classification label to generate accurate and reflective response data.

Status:

Grant

Type:

Utility

Filling date:

15 Jan 2019

Issue date:

17 Aug 2021

Full patent description

Patent application document

International Business Machines Corporation Adversarial training data augmentation data for text classifiers

Abstract:

International Business Machines Corporation
Adversarial training data augmentation data for text classifiers