International Business Machines Corporation
Using machine learning to determine electronic document similarity
Last updated:
Abstract:
Methods and systems for using machine learning to determine electronic document similarity include extracting entities and corresponding relationships from each of two electronic documents of a corpus of electronic documents based on word embedding, computing an entity distance between the extracted entities and a relationship distance between the extracted relationships based on knowledge graph embedding, combining the entity and relationship distances to generate a similarity score between the electronic documents, and implementing the similarity score to perform a task associated with the electronic documents.
Status:
Grant
Type:
Utility
Filling date:
23 Oct 2018
Issue date:
1 Mar 2022