International Business Machines Corporation
SIGNALING CONCEPT DRIFT DURING KNOWLEDGE BASE POPULATION

Last updated:

Abstract:

An approach is provided for signaling concept drift during knowledge base population. A knowledge graph and a collection of text is received, and a vector space is built. A sequence of data items associated with a type of entity or a relation is received. Entities or relations from the knowledge graph are embedded into the vector space to generate entity or relation vectors. Data items associated with the type of entity or the relation are embedded into the vector space to generate data item vectors. An emerging entity or relation concept vector is computed by determining a centroid of the data item vectors. An entity or relation concept vector is computed by determining a centroid of the entity or relation vectors. A signal is generated when a distance between the emerging entity or relation concept vector and the entity or relation concept vector is greater than a threshold.

Status:
Application
Type:

Utility

Filling date:

4 Dec 2019

Issue date:

10 Jun 2021