Infosys Limited
Automated system for development and deployment of heterogeneous predictive models
Last updated:
Abstract:
A method and/or system for heterogeneous predictive models generation based on sampling of big data is disclosed. The method involves receiving a dataset and a target column associated with the dataset at a data processing engine from a distributed data warehouse. One or more columns associated with the dataset are classified at the data processing engine as a categorical column or a continuous column. One or more parameters in the dataset are identified to extract a sample data from the dataset. The sample data from the dataset is extracted based on the identified one or more parameters. One or more rank ordered machine learning algorithms are recommended to one or more users, to generate one or more predictive models from the sample data. One or more heterogeneous predictive models are generated based on the rank ordered algorithm through one or more iterations.
Utility
29 Mar 2017
17 Nov 2020