International Business Machines Corporation
CLASSIFYING DIGITAL DOCUMENTS IN MULTI-DOCUMENT TRANSACTIONS BASED ON EMBEDDED DATES

Last updated:

Abstract:

A generator categorizes documents in one or more transactions into buckets, each identified by a separate category for an expected time window based on a separate relative age of each of the documents evaluated from one or more dates identified in the documents. The generator trains a document classifier with a model of the separate relative age of each of the documents as a temporal characteristic correlated with the respective category of a respective bucket of the buckets. The document classifier executes on a input documents to classify each of the input documents as a particular logical type identified by a particular category from among multiple logical types.

Status:
Application
Type:

Utility

Filling date:

12 Mar 2021

Issue date:

1 Jul 2021