Adobe Inc.
Semantic page segmentation of vector graphics documents

Last updated:

Abstract:

Disclosed systems and methods categorize text regions of an electronic document into document object types based on a combination of semantic information and appearance information from the electronic document. A page segmentation application executing on a computing device provides a textual feature representation and a visual feature representation to a neural network. The application identifies a correspondence between a location of the set of pixels in the electronic document and a location of a particular document object type in an output page segmentation. The application further outputs a classification of the set of pixels as being the particular document object type based on the identified correspondence.

Status:
Grant
Type:

Utility

Filling date:

30 Jan 2020

Issue date:

26 Apr 2022