International Business Machines Corporation
SIGNALING CONCEPT DRIFT DURING KNOWLEDGE BASE POPULATION
Last updated:
Abstract:
An approach is provided for signaling concept drift during knowledge base population. A knowledge graph and a collection of text is received, and a vector space is built. A sequence of data items associated with a type of entity or a relation is received. Entities or relations from the knowledge graph are embedded into the vector space to generate entity or relation vectors. Data items associated with the type of entity or the relation are embedded into the vector space to generate data item vectors. An emerging entity or relation concept vector is computed by determining a centroid of the data item vectors. An entity or relation concept vector is computed by determining a centroid of the entity or relation vectors. A signal is generated when a distance between the emerging entity or relation concept vector and the entity or relation concept vector is greater than a threshold.
Utility
4 Dec 2019
10 Jun 2021