Meta Platforms, Inc.
Resolving entities from multiple data sources for assistant systems

Last updated:

Abstract:

In one embodiment, a method includes accessing a number of records describing a number of entities generated based on data collected from a number of data sources, where the records are grouped by data source, deduping the number of records in each group, selecting a data source as a core source, identifying, for a record in the core group, a candidate set including records from the non-core groups of records that satisfy conditions to be in the candidate set for the record, generating a feature vector for each pair of records between a record in the core group and a record in the candidate set, computing a probability that the pair of records describe a common entity for each pair of records, and linking the record in the candidate set to a globally unique entity identifier identifying a unique entity if the probability exceeds a threshold.

Status:
Grant
Type:

Utility

Filling date:

27 Jul 2018

Issue date:

13 Oct 2020