LendingClub Corporation
EXTRACTING VALUES FROM IMAGES OF DOCUMENTS
Last updated:
Abstract:
Techniques are described for extracting key values from a document without having to rely on finding corresponding labels for the target keys within the extracted text of the document. Further the techniques do not rely on knowledge of the correlation between (a) the location of labels within a document, and (b) the location of the key values that correspond to the labels. Key values are extracted from a document by, identifying candidate values within the document, establishing "joint-candidate" sets from those candidate values, and using a trained machine learning mechanism to score each joint-candidate set of values. The highest scoring joint-candidate set is deemed to reflect the correct mapping of candidate values to target keys for the document.
Utility
27 Dec 2019
1 Jul 2021