International Business Machines Corporation
AUTOMATED GENERATION OF MACHINE LEARNING MODEL PIPELINE COMBINATIONS

Last updated:

Abstract:

A computer receives a dataset and a set of ML pipeline components to generate a preferred ensemble of Machine Learning (ML) pipelines. An Automated Learning (AutoML) tool is applied to generate a plurality of ML pipelines. A performance value is determined for each pipeline, and a set of candidate pipelines is identified based on the performance values. The candidate pipelines are combined into candidate ensembles. A database provides historic performance data for a plurality of historic ensembles applied to a plurality of historic datasets. A metamodel is trained to identify patterns within the historic performance data, and a applies the patterns to generate predicted ensemble performance values for the candidate ensembles. A preferred ensemble is selected based on the predicted performance value rankings.

Status:
Application
Type:

Utility

Filling date:

17 Dec 2020

Issue date:

23 Jun 2022