Microsoft Corporation
Using lineage to infer data quality issues
Last updated:
Abstract:
Identifying data quality along a data flow. A method includes identifying quality metadata for two or more datasets. The quality metadata defines one or more of quality of a data source, accuracy of a dataset, completeness of a dataset, freshness of a dataset, or relevance of a dataset. At least some of the metadata is based on results of operations along a data flow. Based on the metadata, the method includes creating one or more quality indexes for the datasets. The one or more quality indexes include a characterization of quality of two or more datasets.
Status:
Grant
Type:
Utility
Filling date:
4 Nov 2020
Issue date:
23 Aug 2022