one
MAINTAINING A DATASET BASED ON PERIODIC CLEANSING OF RAW SOURCE DATA

Last updated:

Abstract:

In some implementations, a data cleaning platform may determine a respective entity key for each data record in a cleansed dataset based on a combination of fields, in each data record, that contain information that uniquely identifies an entity associated with a respective data record. The data cleaning platform may generate a delta dataset based on a set of uncleansed data records related to transactions that occurred after a time when the cleansed dataset was first generated. For example, in some implementations, each uncleansed data record in the delta dataset may be associated with a corresponding entity key based on the combination of fields. The data cleaning platform may perform a data join to update the cleansed dataset to include data records related to the transactions that occurred after the time when the cleansed dataset was first generated.

Status:
Application
Type:

Utility

Filling date:

1 Feb 2021

Issue date:

4 Aug 2022