International Business Machines Corporation
CLASSIFYING DOCUMENTS BASED ON TEXT ANALYSIS AND MACHINE LEARNING
Last updated:
Abstract:
A computer device identifies a set of documents for classification. The computing device classifies documents of a first subset of the set of documents based, at least in part, on a text analysis of the documents of the first subset. The computing device trains a document classifier using, as training data: (i) results of the classifying of the documents of the first subset, and (ii) metadata associated with the documents of the first subset. The computing device classifies documents of a second subset of the set of documents by providing metadata of the documents of the second subset to the trained document classifier.
Status:
Application
Type:
Utility
Filling date:
7 Oct 2020
Issue date:
7 Apr 2022