International Business Machines Corporation
Extracting attributes from embedded table structures
Last updated:
Abstract:
Embodiments include methods, system and computer program products for extracting attributes from embedded table structures in a document. Aspects include identifying a table in the document and identifying one or more headers of the table by locating co-occurring attributes in the table. Aspects also include identifying a plurality of values in the table and creating an annotation for each of the plurality of values value in the table, wherein each annotation includes text extracted from the one or more headers that correspond to the location of the value in the table.
Status:
Grant
Type:
Utility
Filling date:
9 Sep 2019
Issue date:
8 Mar 2022