International Business Machines Corporation
Optical character recognition (OCR) induction for multi-page changes

Last updated:

Abstract:

Provided are techniques for OCR induction for multi-page changes. A plurality of documents of a document type are processed to generate text area data for a text area in one or more documents of the plurality of documents, where the text area data includes coordinate locations of a zone for the text area based on expansion and direction of shift of the text area. A page flow model is trained using the plurality of documents and the text area data. In response to receiving a new document comprising the text area, a scanning script is received from the page flow model, where the page flow model identifies a new zone for the text area in the new document and determines how to adjust another zone for an element in the new document. The scanning script is used to scan the new document to generate digital text.

Status:
Grant
Type:

Utility

Filling date:

6 Jul 2020

Issue date:

17 May 2022