Intuit Inc.
UNSUPERVISED COMPETITION-BASED ENCODING

Last updated:

Abstract:

A method collects word-based data corresponding to a first identifier. A first phrase vector is generated for the first identifier by extracting frequency data from the word-based data. A similarity metric is generated corresponding to the first identifier and a second identifier by comparing the first phrase vector of the first identifier to a second phrase vector of the second identifier. A tuple is generated that includes the first identifier and the second identifier using the similarity metric. A machine learning model is trained with the tuple to generate an embedded vector corresponding to the first identifier.

Status:
Application
Type:

Utility

Filling date:

28 Jul 2020

Issue date:

3 Feb 2022