International Business Machines Corporation
CLASSIFYING DOCUMENTS BASED ON TEXT ANALYSIS AND MACHINE LEARNING

Last updated:

Abstract:

A computer device identifies a set of documents for classification. The computing device classifies documents of a first subset of the set of documents based, at least in part, on a text analysis of the documents of the first subset. The computing device trains a document classifier using, as training data: (i) results of the classifying of the documents of the first subset, and (ii) metadata associated with the documents of the first subset. The computing device classifies documents of a second subset of the set of documents by providing metadata of the documents of the second subset to the trained document classifier.

Status:
Application
Type:

Utility

Filling date:

7 Oct 2020

Issue date:

7 Apr 2022