Microsoft Corporation
AUTOMATIC EMBEDDING OF ADDITIONAL CONTENT TO ARTICLES
Last updated:
Abstract:
The present disclosure relates to systems, devices, and methods for identifying additional content for an article. The systems, devices, and methods may identify a domain for the articles and content and may use machine learning models to classify the articles and the content into categories using smart tags for the domain. The systems, devices, and methods may convert the articles and the content into document vectors using a pre-trained domain specific language model and generate a relevance score for the articles and the content using the document vectors. The systems, devices, and methods may generate a list of predicted matches that includes content that is similar to the article based on the relevance score. The systems, devices, and methods may filter the list of predicted matches based on a temporal proximity to generate a list of additional content for the article.
Utility
12 Mar 2021
14 Jul 2022