SAP SE
Method and system for automated text anonymization
Last updated:
Abstract:
A method of producing an anonymized vector for a text mining task in lieu of a feature vector is disclosed. A vocabulary is created from a corpus of documents, each of the corpus of documents having a context that is similar to a set of target documents. The set of target documents is received. The feature vector is generated from a first document of the set of target documents. The feature vector is transformed into a composition vector. A synthetic vector is constructed based on the composition vector. The synthetic vector is shared as the anonymized vector in lieu of the feature vector.
Status:
Grant
Type:
Utility
Filling date:
29 Jan 2018
Issue date:
4 May 2021