Amazon.com, Inc.
Transliteration of data records for improved data matching

Last updated:

Abstract:

A data records service is configured to receive original data records and, in parallel, generate a transliterated version of the original data record into a phonetic based language. Individual fields of data records can be transliterated by identifying a primary language, generating language specific tokens for individual text portions, and transliterating the token. The records processing service can then execute matching models on both original data records and transliterated data records to detect matching data records.

Status:
Grant
Type:

Utility

Filling date:

20 Nov 2018

Issue date:

14 Sep 2021