International Business Machines Corporation
Document analysis technique for understanding information
Last updated:
Abstract:
A computer-implemented method, system and computer program product for understanding information using a document analysis technique. An initial corpus of information is formed by identifying a document(s) that match a search criteria. The initial corpus of information is expanded with a set of documents containing statements with a semantic meaning within a threshold degree of similarity to a semantic meaning of statements contained within the document(s) used to form the initial corpus of information. Viewpoint(s) are then extracted from the expanded corpus of information using a natural language processing technique. A new set of documents is analyzed by identifying the subject, assertion and context statements. Assertions in the new set of documents that are within a threshold degree of agreement or disagreement with the extracted viewpoint are highlighted to assist the user in understanding how information aligns with a viewpoint.
Utility
20 Aug 2019
14 Dec 2021