International Business Machines Corporation
AUTOMATED DATA LABELING

Last updated:

Abstract:

Systems and methods for computer-automated labeling of data are disclosed. In embodiments, a method includes: identifying technical data assets in lineage data and corresponding business items in User Interface (UI) data of a user, wherein the lineage data includes a data source for the UI data; mapping the technical data assets to the corresponding business items; determining relevant labels to assign to the technical data assets from a label repository based on a similarity analysis of names of the corresponding business items and labels in the label repository; determining that one or more of the relevant labels meet a confidence threshold based on the similarity analysis; and automatically assigning the one or more of the relevant labels to associated ones of the technical data assets based on the determining that the one or more of the relevant labels meet the confidence threshold.

Status:
Application
Type:

Utility

Filling date:

13 Jan 2020

Issue date:

15 Jul 2021