International Business Machines Corporation
Data model proposals
Last updated:
Abstract:
Relating data in various distributed data sources for use in data analysis is described. The data sources are generally related by first generating a keyword model for a plurality of data sources, which includes a plurality of weighted keywords, and providing a visual representation of the keyword model, such as a word cloud, to a user. The user interacts with the visual representation to modify, update, and select various aspects of the visual representation. The user also identifies keywords and data sources of interest such that a plurality of relational models are generated based on the user interest. Relating the data sources also includes providing the plurality of relational models to the user, receiving a user selection of the plurality of relational models, and generating a combined dataset model which relates one or more of the data sources according to the selected relational models.
Utility
26 Nov 2019
26 Apr 2022