Intuit Inc.
Label and field identification without optical character recognition (OCR)
Last updated:
Abstract:
Systems of the present disclosure allow fields and labels to be identified in a digital image of a form without performing OCR. A digital image of a form can be partitioned into image segments using computer-vision image-segmentation techniques. Features for each image segment can be extracted using computer-vision feature-detection methods. The features extracted from an image segment can be included in an input instance for a machine-learning model. The machine-learning model can assign a classification to the input instance. The classification can associate the input instance with a field type or a label type.
Status:
Grant
Type:
Utility
Filling date:
24 Apr 2018
Issue date:
14 Apr 2020