Wipro Limited
METHOD AND SYSTEM FOR IDENTIFYING TYPE OF A DOCUMENT
Last updated:
Abstract:
Disclosed herein is a method and system for identifying type of an input document in real-time. In an embodiment, visual features and keywords of the input document are compared with reference visual features and reference keywords extracted from plurality of predetermined document types for computing a relative similarity score for the input document. Subsequently, one or more best-match document types are identified among the plurality of predetermined document types based on the relative similarity score of the input document. Thereafter, visual features and keywords of the input document are compared with global and local characteristics of the best-match document types for identifying the type of the input document. In an embodiment, the present disclosure helps in recognizing type of a document prior to digitizing the document, and thereby helps in storing the digitized documents in correct formats and appropriate storage directories.
Utility
28 Mar 2018
3 Oct 2019