International Business Machines Corporation
GENERATING QUANTITATIVELY ASSESSED SYNTHETIC TRAINING DATA
Last updated:
Abstract:
In an approach to generating quantitatively assessed synthetic training data, one or more computer processors identify an initial plurality of clusters in a dataset utilizing a trained classification model and a plurality of associated hyperparameters, wherein the clusters have sufficient density to be represented in a calculated probability distribution. The one or more computer processors generate one or more synthetic data points for each identified cluster utilizing a corresponding calculated probability distribution. The one or more computer processors quantitatively assess the one or more generated synthetic data points.
Status:
Application
Type:
Utility
Filling date:
19 Mar 2020
Issue date:
23 Sep 2021