International Business Machines Corporation
AUTOMATED MACHINE-LEARNING DATASET PREPARATION

Last updated:

Abstract:

A method of preparing a dataset may comprise calculating a pattern relevance for a first field in the dataset and modifying the first field based on the pattern relevance. The method may further comprise detecting a contextual cue in the first field. The method may further comprise retrieving contextual information for a value in the first field and adding that contextual information to the database. Finally, the method may further comprise identifying a numerical scheme for the first field and parsing the first field into a number according to that numerical scheme.

Status:
Application
Type:

Utility

Filling date:

4 May 2020

Issue date:

4 Nov 2021