International Business Machines Corporation
AUTOMATED GENERATION OF MACHINE LEARNING MODEL PIPELINE COMBINATIONS
Last updated:
Abstract:
A computer receives a dataset and a set of ML pipeline components to generate a preferred ensemble of Machine Learning (ML) pipelines. An Automated Learning (AutoML) tool is applied to generate a plurality of ML pipelines. A performance value is determined for each pipeline, and a set of candidate pipelines is identified based on the performance values. The candidate pipelines are combined into candidate ensembles. A database provides historic performance data for a plurality of historic ensembles applied to a plurality of historic datasets. A metamodel is trained to identify patterns within the historic performance data, and a applies the patterns to generate predicted ensemble performance values for the candidate ensembles. A preferred ensemble is selected based on the predicted performance value rankings.
Utility
17 Dec 2020
23 Jun 2022