Open Text Corporation
METHOD AND SYSTEM FOR ASSESSING SIMILARITY OF DOCUMENTS

Last updated:

Abstract:

Systems and methods for assessing similarity of documents are provided. Embodiments of the systems and methods include extracting a reference document text from a reference document, extracting an archived document text from an archived document, and quantifying the reference document and the archived document. The systems and methods may also include determining a document similarity value of the quantified reference document and the archived document. Determining the document similarity value includes calculating a set of vector similarity values for a set of combinations of a reference document text vector and an archived document text vector, and calculating the document similarity value, including a sum of the plurality of vector similarity values.

Status:
Application
Type:

Utility

Filling date:

22 Nov 2019

Issue date:

19 Mar 2020