Groupon, Inc.
System, method, and computer program product for generation of local content corpus
Last updated:
Abstract:
Various methods for generating a content corpus populated with content related to a particular geographic area are provided herein. One example method comprises, for each document in an initial local content corpus, applying a first set of heuristic filters to the raw content of each document, identifying at least a second term, applying a second set of heuristic filters to the raw content of each document, the second set of heuristic filters associated with the second term, iteratively performing the identification of additional terms and application of an additional set of heuristic filters associated with the additional terms until each identifiable term is extracted, determining a level on a geographic containment hierarchy indicative of a location to which each document from the set of documents is local, and for each place in a gazette, and for each document, determining a set of points in polygons indicative of its locality.
Utility
3 Jan 2017
3 Nov 2020