International Business Machines Corporation
Navigating unstructured documents using structured documents including information extracted from unstructured documents

Last updated:

Abstract:

Aspects of the present disclosure describe techniques for generating a machine learning model for extracting information from textual content. The method generally includes receiving an unstructured document and a structured document including information extracted from the unstructured document and position information associated with the extracted information. The unstructured document is rendered in a first pane, and a graphical rendering of the structured document is rendered in a second pane. The graphical rendering generally may be a structure in which content from the structured document is displayed in a hierarchical format. Each element in the structured document is linked to the rendered unstructured document based on position information included in the structured document.

Status:
Grant
Type:

Utility

Filling date:

7 Feb 2020

Issue date:

19 Jul 2022