Oracle Corporation
Data loss prevention system for cloud security based on document discourse analysis

Last updated:

Abstract:

Systems, devices, and methods of the present invention are related to determining a document classification. For example, a document classification application generates a set of discourse trees, each discourse tree corresponding to a sentence of a document and including a rhetorical relationship that relates two elementary discourse units. The document classification application creates one or more communicative discourse trees from the discourse trees by matching each elementary discourse unit in a discourse tree that has a verb to a verb signature. The document classification application combines the first communicative discourse tree and the second communicative discourse tree into a parse thicket and applies a classification model to the parse thicket in order to determine whether the document is public or private.

Status:
Grant
Type:

Utility

Filling date:

15 Jun 2018

Issue date:

24 Aug 2021