one
DATA LABELING FOR SYNTHETIC DATA GENERATION

Last updated:

Abstract:

Aspects described herein may relate to methods, systems, and apparatuses for labeling data in connection with synthetic data generation. The data labeling may begin with a manual process where a user provides labels for data. Based on the labels provided by the user, modified data may be generated and may include one or more encodings associated with the labels provided by the user. A machine-learning model may be trained to predict labels based on the modified data samples. Accuracy of the model may be determined based on comparing the predicted labels to further labels provided by the user and/or by allowing the user to indicate whether predicted labels are correct or incorrect. Once the model is determined to be accurate, the predicted labels may be used as a basis for generating synthetic data.

Status:
Application
Type:

Utility

Filling date:

3 Mar 2021

Issue date:

8 Sep 2022