First American Financial Corporation
System and method for automated selection of best description from descriptions extracted from a plurality of data sources using numeric comparison and textual centrality measure

Last updated:

Abstract:

Techniques are described for collecting descriptions of an entity from different data sources and using a numeric comparison and textual centrality measure to automatically select a best description. In one implementation, a method includes: retrieving a real property description dataset, the real property description dataset including descriptions from multiple data sources that describe the real property; extracting, from each of the descriptions, numbers that identify the property; performing a numerical comparison of the numbers extracted from each of the descriptions to determine if any descriptions needs to be discarded from further consideration; applying a text cleaning process to normalize the descriptions; and performing a textual centrality measure of remaining descriptions to determine a level agreement of each of the remaining descriptions with each of the other remaining descriptions; and using at least the textual centrality measure to select a description. The selected description may be used to populate a document.

Status:
Grant
Type:

Utility

Filling date:

19 Dec 2018

Issue date:

4 May 2021