International Business Machines Corporation
AUTOMATED NON-NATIVE TABLE REPRESENTATION ANNOTATION FOR MACHINE-LEARNING MODELS

Last updated:

Abstract:

One embodiment provides a method, including: receiving two documents, one of the two documents having at least one table that includes the same information as a corresponding table in the other of the two documents, wherein (i) one of the two documents comprises the at least one table in an unstructured table representation and (ii) the other of the two documents comprises the at least one table in a structured table representation; identifying text elements within the at least one table in the unstructured table representation; matching the identified text elements with table elements within the at least one table in the structured table representation; and annotating the at least one table in the structured table representation based upon the matches between the table elements and text elements.

Status:
Application
Type:

Utility

Filling date:

14 Apr 2020

Issue date:

14 Oct 2021