one
Computer-based systems for performing a candidate phrase search in a text document and methods of use thereof

Last updated: 24 Aug 2022

Abstract:

A method for identifying phrases in a text document having a similar discourse to a candidate phrase includes separating text in a document file into a plurality of phrases and generating a plurality of embedding vectors in a textual embedding space by inputting the plurality of phrases into an embedding engine. A mapping of each embedding vector in the textual embedding space is generated with each corresponding phrase and a document location of each corresponding phrase in the document file. A candidate phrase is received by a user and a candidate embedding vector is generated using the embedding engine. Similarity scores are computed based on the plurality of embedding space distances between the candidate phrase embedding vector location and each respective location of each embedding vector in the textual embedding space. A listing of phrases with the highest similarity scores are outputted with respective document locations in the text.

Status:

Grant

Type:

Utility

Filling date:

12 Jun 2020

Issue date:

23 Aug 2022

Full patent description

Patent application document

one Computer-based systems for performing a candidate phrase search in a text document and methods of use thereof

Abstract:

one
Computer-based systems for performing a candidate phrase search in a text document and methods of use thereof