eBay Inc.
Identification of content in an electronic document
Last updated:
Abstract:
In some embodiments, a method includes receiving an electronic document that comprises a plurality of sections. The method includes marking the plurality of sections as a content section or a non-content section using a visual attribute of the sections that includes at least one of a width of the section, a density of the plurality of hyperlinks in the section, a size of a font of text in the section and whether a title of the electronic document overlaps with text in the section. The method also includes storing the marking of the plurality of sections of the electronic document in a machine-readable medium.
Status:
Grant
Type:
Utility
Filling date:
2 Oct 2019
Issue date:
2 Nov 2021