Adobe Inc.
Identifying artifacts in digital documents

Last updated:

Abstract:

Techniques described herein implement identifying artifacts in digital documents in a digital medium environment. A document analysis system is leveraged to extract page features from a digital document and to determine whether certain page features represent page artifacts such as headers and footers. Those page features determined to be page artifacts can be extracted from the digital document to generate a reflowed version of the digital document that preserves primary content. The primary content, for instance, is rearranged in the reflowed document to compensate for the extracted page artifacts.

Status:
Grant
Type:

Utility

Filling date:

25 Oct 2019

Issue date:

16 Mar 2021