International Business Machines Corporation
Suspect duplicate processing through a feedback-driven learning process

Last updated:

Abstract:

Methods and apparatus, including computer program products, implementing and using techniques for processing suspect duplicate records in a master data management system. A master data management module identifies two or more suspect duplicate records in the master data management system based on scores. A matching engine classifies the two or more suspect duplicate records, by comparing the scores against threshold values, into one of: a match, a non-match, and a possible match. The master data management module re-classifies the suspect duplicate records and adjusting the threshold values of the matching engine for classification of future records, in response to receiving, by a data stewardship client, a user input indicating an incorrect classification of the suspect duplicate records.

Status:
Grant
Type:

Utility

Filling date:

11 Jun 2018

Issue date:

17 May 2022