Bank of America Corporation
Validating mappings between documents using machine learning
Last updated:
Abstract:
A device that includes an enterprise data indexing engine (EDIE) configured to determine a first set of similarity scores between a first set of sentences from a first document and a plurality of classification descriptions. The EDIE is further configured to identify one or more classification descriptions that have a similarity score that exceeds a predetermined threshold value. The EDIE is further configured to determine a second set of similarity scores between a second set of sentences from a second document and the plurality of classification descriptions. The EDIE is further configured to identify one or more classification descriptions that have a similarity score that exceeds the predetermined threshold value. The EDIE is further configured to populate a data structure that identifies the tokens within the first set of tokens and the second set of tokens and the number of times each token appears.
Utility
30 Aug 2019
10 May 2022