International Business Machines Corporation
Predicting keywords in an application
Last updated:
Abstract:
A computer-implemented method includes training a classification model to produce a mapping from a set of feature vectors to a set of confidence levels. Each feature vector describes a respective term, and each confidence level indicates a likelihood that the respective term is a keyword. A plurality of program artifacts are tokenized into a plurality of terms. For each term in the plurality of terms, a respective feature vector is determined for describing the term, based on the plurality of program artifacts. For each term in the plurality of terms, based on the respective feature vector, using the mapping, a respective confidence level is determined for indicating a likelihood that the term is a keyword of the plurality of program artifacts.
Utility
25 May 2018
27 Jul 2021