International Business Machines Corporation
Mixed initiative feature engineering
Last updated:
Abstract:
Computerized interactive feature visualization is carried out on a data set--a plurality of insight classes rank a plurality of features of the data set. Via a computerized user interface, user feedback is obtained based on the interactive feature visualization--a user selects and ranks a subset of the features. At least one transformation function is applied to at least one feature of the subset of features selected by the user, to automatically construct, with a computer, at least one additional feature for the data set. The data set with the at least one additional feature is a transformed data set. In some cases, a supervised task is carried out on the final data set; accuracy of a machine learning system implementing the at least one supervised task can be enhanced by the at least one additional feature, and/or a physical system can be controlled based on results of the at least one supervised task.
Utility
20 Feb 2019
2 Aug 2022