International Business Machines Corporation
QUALITY ASSESSMENT OF MACHINE-LEARNING MODEL DATASET

Last updated:

Abstract:

One embodiment provides a method, including: obtaining a dataset for use in building a machine-learning model; assessing a quality of the dataset, wherein the quality is assessed in view of an effect of the dataset on a performance of the machine-learning model, wherein the assessing comprises scoring the dataset with respect to each of a plurality of attributes of the dataset; for each of the plurality of attributes having a low quality score, providing at least one recommendation for increasing the quality of the dataset with respect to the attribute having a low quality score; and for each of the plurality of attributes having a low quality score, providing an explanation explaining a cause of the low quality score for the attribute having a low quality score.

Status:
Application
Type:

Utility

Filling date:

28 Sep 2020

Issue date:

31 Mar 2022