International Business Machines Corporation
Context-aware action recognition by dual attention networks
Last updated:
Abstract:
Provided are embodiments including a computer-implemented method for performing recognition. The computer-implemented method includes receiving video data, and performing, at a pre-attention prediction module, a pre-attention prediction for the video data to generate first prediction priors. The computer-implemented method also includes receiving, at a dual attention module, data including the video data and data from the pre-attention prediction to generate attention maps, wherein the attention maps indicate a region of interest of a frame of the video data, wherein the dual attention module generates enhanced feature representations, and performing, at a post-attention prediction module, a post-attention prediction from data from the dual attention module based at least in part on the enhanced feature representation. Also provided are embodiments for a system and a computer program produce for performing recognition.
Utility
10 Jul 2020
7 Dec 2021