Wipro Limited
METHOD, DEVICE, AND SYSTEM FOR CLUSTERING DOCUMENT OBJECTS BASED ON INFORMATION CONTENT
Last updated:
Abstract:
This disclosure relates to method, device, Wand system for clustering document objects based on information content. The method may include identifying a plurality of object chunks from at least one document based on semantic context of each of the plurality of object chunks, determining at least one document portion from the at least one document as a base document based on a plurality of parameters applied to the plurality of object chunks, determining a plurality of hierarchies within the base document, and categorizing the plurality of object chunks based on the plurality of hierarchies and information in each of the plurality of object chunks. It should be noted that each of the plurality of object chunks may include at least one object selected from the at least one document.
Utility
29 Jan 2019
4 Jun 2020