International Business Machines Corporation
DEEP DATA CLASSIFICATION USING GOVERNANCE AND MACHINE LEARNING
Last updated:
Abstract:
A method of data classification includes: identifying a cluster of data classes; classifying columns of a current data set; identifying the cluster in the current data set; determining, based on the cluster, an expected column is missing from the current data set; determining a neighboring data set; identifying the expected column in the neighboring data set; classifying the expected column in the neighboring data set; creating a new data class in the current data set; and classifying an unclassified column in the current data set or the neighboring data set with the new data class.
Status:
Application
Type:
Utility
Filling date:
19 Mar 2020
Issue date:
23 Sep 2021