Oracle Corporation
SYSTEMS AND METHODS FOR OPTIMIZING MACHINE LEARNING MODELS BY SUMMARIZING LIST CHARACTERISTICS BASED ON MULTI-DIMENSIONAL FEATURE VECTORS

Last updated:

Abstract:

Techniques for summarizing lists for machine learning operations are disclosed. In some embodiments, a machine learning system generates feature vectors for a set of items based on varying values among a set of feature attributes. The system further generates, based on the feature vectors a set of clusters and generates a summary vector for a list of items as a function of the distribution of the items within the set of clusters, where the summary vector has a length equal to how many clusters are in the set of clusters. Summary vectors may be generated for a plurality of examples within a training dataset. The system may use the summary vectors to train a machine learning model to estimate unknown labels for new examples.

Status:
Application
Type:

Utility

Filling date:

29 Jul 2019

Issue date:

4 Feb 2021