Alibaba Group Holding Limited
AUTOMATIC OPTICAL CHARACTER RECOGNITION (OCR) CORRECTION
Last updated:
Abstract:
Disclosed herein are computer-implemented methods, computer-implemented systems, and non-transitory, computer-readable media for automatic Optical Character Recognition (OCR) correction. One computer-implemented method includes evaluating an OCR result using a trained Long short-term memory (LSTM) neural network language model to determine whether correction to the OCR result is required. If correction to the OCR result is required, a most similar text relative to the OCR result is determined from a name and address corpus using a modified edit distance technique. The OCR result is corrected with the determined most similar text.
Status:
Application
Type:
Utility
Filling date:
14 Feb 2020
Issue date:
3 Dec 2020