International Business Machines Corporation
Distinguishing web content and web content-associated data
Last updated:
Abstract:
A computer-implemented method can include obtaining content-associated data. The content-associated data can include an image that corresponds to content. The method can further include obtaining a set of descriptive terms corresponding to the image. The method can further include obtaining a set of related terms, based at least in part on the set of descriptive terms. The method can further include generating a set of user-expected terms, based at least in part on the set of related terms. The method can further include obtaining the content. The method can further include comparing the set of user-expected terms to the content. The method can further include generating a similarity value based on the comparing. The method can further include determining the similarity value does not exceed a threshold. The method can further include providing a notification in response to the determining that the similarity value does not exceed the threshold.
Utility
20 Aug 2019
28 Dec 2021