one
Systems and methods for text localization and recognition in an image of a document

Last updated:

Abstract:

Disclosed are methods, systems, and non-transitory computer-readable medium for localization and recognition of text from images. For instance, a first method may include: receiving an image; processing the image through a convolutional backbone to obtain feature maps(s); processing the feature maps through a region of interest (RoI) network to obtain RoIs; filtering the RoIs through a filtering block to obtain final RoIs; and processing the final RoIs through a text recognition stack to obtain predicted character sequences for the final RoIs. A second method may include: constructing a text localization and recognition neural network (TLaRNN); obtaining training data; training the TLaRNN on the training data; and storing trained weights of the TLaRNN. The constructing the TLaRNN may include: connecting a convolutional backbone to a region of interest (RoI) network; connecting the RoI network to a filtering block; and connecting the filtering block to a text recognition network.

Status:
Grant
Type:

Utility

Filling date:

24 Apr 2020

Issue date:

1 Jun 2021