International Business Machines Corporation
Using machine learning to determine electronic document similarity

Last updated:

Abstract:

Methods and systems for using machine learning to determine electronic document similarity include extracting entities and corresponding relationships from each of two electronic documents of a corpus of electronic documents based on word embedding, computing an entity distance between the extracted entities and a relationship distance between the extracted relationships based on knowledge graph embedding, combining the entity and relationship distances to generate a similarity score between the electronic documents, and implementing the similarity score to perform a task associated with the electronic documents.

Status:
Grant
Type:

Utility

Filling date:

23 Oct 2018

Issue date:

1 Mar 2022