Airbnb, Inc.
Feature transformation and missing values

Last updated:

Abstract:

A behavior detection module receives a training database and applies a transformation to the attributes that improves the uniformity of the values associated with each attribute. The transformed training database is used to construct a random forest classifier (RFC). The RFC includes a plurality of decision trees and generates a classification label estimate for a data entry with a plurality of attributes. The classification label estimate is determined based on classification estimates from the plurality of decision trees. Each parent node of a decision tree is associated with a condition of a transformed attribute that directs the data entry to a corresponding child node depending on whether the condition is satisfied or not. The data entry is directed through the tree to one out of a set of leaf nodes, and a classification label associated with the leaf node.

Status:
Grant
Type:

Utility

Filling date:

14 Dec 2015

Issue date:

14 Jan 2020