International Business Machines Corporation
Embedding natural language context in structured documents using document anatomy

Last updated:

Abstract:

Methods, systems and computer program products for natural language context embedding are provided herein. A computer-implemented method includes extracting a document anatomy and document elements from a given structured document, identifying semantic references in the given structured document, and generating an ontology comprising (i) a hierarchy of concepts and (ii) relations connecting the concepts, each concept comprising attributes for a document element. The computer-implemented method also includes generating natural language text context for a given document element by utilizing the ontology to combine (i) attributes of a given concept corresponding to the given document element with (ii) attributes of another concept, the other concept corresponding to another document element, the other concept being connected to the given concept by at least one relation. The computer-implemented method further includes modifying the given structured document by embedding the natural language context with the given document element in the given structured document.

Status:
Grant
Type:

Utility

Filling date:

3 Dec 2018

Issue date:

19 Oct 2021