eBay Inc.
IDENTIFICATION OF CONTENT IN AN ELECTRONIC DOCUMENT
Last updated:
Abstract:
In some embodiments, a method includes receiving an electronic document that comprises a plurality of sections. The method includes marking the plurality of sections as a content section or a non-content section using a visual attribute of the sections that includes at least one of a width of the section, a density of the plurality of hyperlinks in the section, a size of a font of text in the section and whether a title of the electronic document overlaps with text in the section. The method also includes storing the marking of the plurality of sections of the electronic document in a machine-readable medium.
Status:
Application
Type:
Utility
Filling date:
2 Oct 2019
Issue date:
2 Apr 2020