International Business Machines Corporation
Scalable structure learning via context-free recursive document decomposition
Last updated:
Abstract:
An approach is provided in which the approach aggregates a set of pixel values from a bitmap image into a set of row sum values and a set of column sum values. The bitmap image is a pixelated representation of a document. The approach applies a localized Fourier transform to the set of row sum values and the set of column sum values to generate frequency representations of the set of row sum values and the set of frequency sum values. The approach decomposes the bitmap image into a set of image portions based on at least one separation location identified in the set of frequency representations, and sends the set of image portions to a text recognition system.
Status:
Grant
Type:
Utility
Filling date:
16 Sep 2019
Issue date:
30 Nov 2021