one
MACHINE-LEARNING BASED DATA ENTRY DUPLICATION DETECTION AND MITIGATION AND METHODS THEREOF

Last updated:

Abstract:

Systems and methods of the present disclosure enable a processor to automatically detect duplicate data entries by receiving data entries associated with a user, where each data entry includes a value, a time, an entity identifier, and a location. Pairs of similar data entries are determined by matching the entity identifier and the location pairs data entries. Candidate duplicate data entries are determined based on a proximity in time between data entries of the similar data entries. For each candidate duplicate data entry, a feature vector is generated including the entity identifier, location, value and time, and each feature vector is submitted to a duplicate classification model to automatically determine duplicate data entries from the candidate duplicate data entries, the duplicate classification model being trained according to a historical dispute entries.

Status:
Application
Type:

Utility

Filling date:

31 Dec 2020

Issue date:

30 Jun 2022