Synaptics Incorporated
RECURRENT MULTIMODAL ATTENTION SYSTEM BASED ON EXPERT GATED NETWORKS

Last updated:

Abstract:

Systems and methods for multimodal classification include a plurality of expert modules, each expert module configured to receive data corresponding to one of a plurality of input modalities and extract associated features, a plurality of class prediction modules, each class prediction module configured to receive extracted features from a corresponding one of the expert modules and predict an associated class, a gate expert configured to receive the extracted features from the plurality of expert modules and output a set of weights for the input modalities, and a fusion module configured to generate a weighted prediction based on the class predictions and the set of weights. Various embodiments include one or more of an image expert, a video expert, an audio expert, class prediction modules, a gate expert, and a co-learning framework.

Status:
Application
Type:

Utility

Filling date:

20 May 2019

Issue date:

21 Nov 2019