SAP SE
Method and system for automated text anonymization

Last updated:

Abstract:

A method of producing an anonymized vector for a text mining task in lieu of a feature vector is disclosed. A vocabulary is created from a corpus of documents, each of the corpus of documents having a context that is similar to a set of target documents. The set of target documents is received. The feature vector is generated from a first document of the set of target documents. The feature vector is transformed into a composition vector. A synthetic vector is constructed based on the composition vector. The synthetic vector is shared as the anonymized vector in lieu of the feature vector.

Status:
Grant
Type:

Utility

Filling date:

29 Jan 2018

Issue date:

4 May 2021