Adobe Inc.
Constructing content based on multi-sentence compression of source content

Last updated:

Abstract:

Embodiments of the present invention provide systems, methods, and computer storage media directed to facilitating corpus-based content generation, in particular, using graph-based multi-sentence compression to generate a final content output. In one embodiment, pre-existing source content is identified and retrieved from a corpus. The source content is then parsed into sentence tokens, mapped and weighted. The sentence tokens are further parsed into word tokens and weighted. The mapped word tokens are then compressed into candidate sentences to be used in a final content. The final content is assembled using ranked candidate sentences, such that the final content is organized to reduce information redundancy and optimize content cohesion.

Status:
Grant
Type:

Utility

Filling date:

26 Dec 2017

Issue date:

16 Mar 2021