Intuit Inc.
UNSUPERVISED COMPETITION-BASED ENCODING
Last updated:
Abstract:
A method collects word-based data corresponding to a first identifier. A first phrase vector is generated for the first identifier by extracting frequency data from the word-based data. A similarity metric is generated corresponding to the first identifier and a second identifier by comparing the first phrase vector of the first identifier to a second phrase vector of the second identifier. A tuple is generated that includes the first identifier and the second identifier using the similarity metric. A machine learning model is trained with the tuple to generate an embedded vector corresponding to the first identifier.
Status:
Application
Type:
Utility
Filling date:
28 Jul 2020
Issue date:
3 Feb 2022