Adobe Inc.
EXTRACTING DEFINITIONS FROM DOCUMENTS UTILIZING DEFINITION-LABELING-DEPENDENT MACHINE LEARNING BACKGROUND

Last updated:

Abstract:

This disclosure describes methods, non-transitory computer readable storage media, and systems that extract a definition for a term from a source document by utilizing a single machine-learning framework to classify a word sequence from the source document as including a term definition and to label words from the word sequence. To illustrate, the disclosed system can receive a source document including a word sequence arranged in one or more sentences. The disclosed systems can utilize a machine-learning model to classify the word sequence as comprising a definition for a term and generate labels for the words from the word sequence corresponding to the term and the definition. Based on classifying the word sequence and the generated labels, the disclosed system can extract the definition for the term from the source document.

Status:
Application
Type:

Utility

Filling date:

11 Aug 2020

Issue date:

17 Feb 2022