NICE Ltd.
SYSTEMS AND METHODS FOR PRODUCING A SEMANTIC REPRESENTATION OF A DOCUMENT
Last updated:
Abstract:
A system and method for determining an embedding for a document (e.g. representing the document in vector space) by determining for the document a preliminary document embedding; determining for the document a document topic embedding based on a set of nearest topics to the preliminary document embedding; determining for each phrase in the document a topic relevancy score based on the document topic embedding and the embedding associated with the phrase; using a ranking algorithm to determine a saliency score for each phrase in the document, each saliency score based on the topic relevancy score for the phrase, and an inverse frequency score for the phrase; and calculating an embedding for the document using the saliency scores and embedding, for the phrases in the document.
Utility
18 Feb 2021
18 Aug 2022