International Business Machines Corporation
Data processing

Last updated:

Abstract:

A method and associated system. Multiple virtual triples for an entity of multiple entities identified within a first data source are generated. Each virtual triple consists of a subject, a predicate, and an object. The subject is the entity. The predicate is a relationship between the entity and other entities identified within the first data source. The object is associated with an attribute of the entity. The subject, the predicate, and the object are each identified within the first data source. A degree of similarity between two entities of the two or more entities is identified by comparing the respective frequency metrics of the two entities. The two entities within the data structure are associated in response to a determination that an identified degree of similarity between the two entities exceeds a first predetermined threshold.

Status:
Grant
Type:

Utility

Filling date:

3 Jan 2020

Issue date:

30 Nov 2021