International Business Machines Corporation
Privacy Protection Through Template Embedding

Last updated:

Abstract:

A mechanism is provided to implement a personally identifiable information (PII) detection mechanism that facilitates privacy protection utilizing template embedding learned from text sequences. Input text is processed using natural language processing to identify one or more pieces of personally identifiable information. A character analysis is performed of each character of each piece of personally identifiable information of the one or more pieces of personally identifiable information to identify a character type of character in the piece of personally identifiable information. For each piece of personally identifiable information and based on the associated identified character type, the identified character type is mapped to an associated template character in a set of template characters in a template character data structure. Utilizing the character-to-template mappings for the one or more pieces of personally identifiable information, an output text is generated that projects the template characters by direct character-level mapping.

Status:
Application
Type:

Utility

Filling date:

22 Jan 2020

Issue date:

22 Jul 2021