Microsoft Corporation
Topic set refinement
Last updated:
Abstract:
A computing system including one or more processors generates a topic set for a domain. A taxonomic evaluator is executed by the one or more processors to evaluate a set of category clusters generated from domain-specific textual data against a domain-specific taxonomic tree based on a coherency condition and to identify the category clusters that satisfy the coherency condition. The domain-specific taxonomic tree is generated from hierarchical structures of documents relating to the domain. Each identified category cluster is labeled with a label. A topic set creator is executed by the one or more processors to insert the labels of the set of identified category clusters into the topic set for the domain.
Utility
22 Jun 2018
26 Oct 2021