Adobe Inc.
MACHINE-LEARNING TECHNIQUES FOR EVALUATING SUITABILITY OF CANDIDATE DATASETS FOR TARGET APPLICATIONS

Last updated:

Abstract:

Techniques disclosed herein relate generally to evaluating and selecting candidate datasets for use by software applications, such as selecting candidate datasets for training machine-learning models used in software applications. Various machine-learning and other data science techniques are used to identify unique entities in a candidate dataset that are likely to be part of target entities for a software application. A merit attribute is then determined for the candidate dataset based on the number of unique entities that are likely to be part of the target entities, and weights associated with these unique entities. The merit attribute is used to identify the most efficient or most cost-effective candidate dataset for the software application.

Status:
Application
Type:

Utility

Filling date:

13 Feb 2019

Issue date:

13 Aug 2020