Alibaba Group Holding Limited
AUTOMATIC OPTICAL CHARACTER RECOGNITION (OCR) CORRECTION

Last updated:

Abstract:

Disclosed herein are computer-implemented methods, computer-implemented systems, and non-transitory, computer-readable media for automatic Optical Character Recognition (OCR) correction. One computer-implemented method includes evaluating an OCR result using a trained Long short-term memory (LSTM) neural network language model to determine whether correction to the OCR result is required. If correction to the OCR result is required, a most similar text relative to the OCR result is determined from a name and address corpus using a modified edit distance technique. The OCR result is corrected with the determined most similar text.

Status:
Application
Type:

Utility

Filling date:

14 Feb 2020

Issue date:

3 Dec 2020