International Business Machines Corporation
CLASSIFYING DIGITAL DOCUMENTS IN MULTI-DOCUMENT TRANSACTIONS BASED ON EMBEDDED DATES
Last updated:
Abstract:
A generator categorizes documents in one or more transactions into buckets, each identified by a separate category for an expected time window based on a separate relative age of each of the documents evaluated from one or more dates identified in the documents. The generator trains a document classifier with a model of the separate relative age of each of the documents as a temporal characteristic correlated with the respective category of a respective bucket of the buckets. The document classifier executes on a input documents to classify each of the input documents as a particular logical type identified by a particular category from among multiple logical types.
Status:
Application
Type:
Utility
Filling date:
12 Mar 2021
Issue date:
1 Jul 2021