International Business Machines Corporation
TEXT BLOCK RECOGNITION BASED ON DISCRETE CHARACTER RECOGNITION AND TEXT INFORMATION CONNECTIVITY
Last updated:
Abstract:
In an approach for a text block recognition in a document, a processor detects characters in the document using an object detection technique. A processor identifies positions of the detected characters in the document. A processor analyzes semantic connectivity among the detected characters based on the positions and semantic connectivity of the characters. A processor recognizes text blocks of related characters based on the semantic connectivity analysis. A processor outputs the text blocks associated with the related characters.
Status:
Application
Type:
Utility
Filling date:
30 Jul 2020
Issue date:
3 Feb 2022