International Business Machines Corporation
DEEP DATA CLASSIFICATION USING GOVERNANCE AND MACHINE LEARNING

Last updated:

Abstract:

A method of data classification includes: identifying a cluster of data classes; classifying columns of a current data set; identifying the cluster in the current data set; determining, based on the cluster, an expected column is missing from the current data set; determining a neighboring data set; identifying the expected column in the neighboring data set; classifying the expected column in the neighboring data set; creating a new data class in the current data set; and classifying an unclassified column in the current data set or the neighboring data set with the new data class.

Status:
Application
Type:

Utility

Filling date:

19 Mar 2020

Issue date:

23 Sep 2021