International Business Machines Corporation
Interactive Structure Annotation with Artificial Intelligence
Last updated:
Abstract:
A computer system, product, and method are provided to utilize machine learning to facilitate document processing. A document collection is introduced to an artificial neural network (ANN), which subjects the document collection to table region identification within discretized contiguous areas. The documents are assigned to one or more clusters responsive to the leveraged ANN. Documents are selectively evaluated from the clusters, and one or more label corrections are applied to the ANN. The ANN generates an updated document collection incorporating the applied one or more label corrections.
Status:
Application
Type:
Utility
Filling date:
13 Nov 2020
Issue date:
19 May 2022