NVIDIA Corporation
ITERATIVE SPATIO-TEMPORAL ACTION DETECTION IN VIDEO
Last updated:
Abstract:
Iterative prediction systems and methods for the task of action detection process an inputted sequence of video frames to generate an output of both action tubes and respective action labels, wherein the action tubes comprise a sequence of bounding boxes on each video frame. An iterative predictor processes large offsets between the bounding boxes and the ground-truth.
Status:
Application
Type:
Utility
Filling date:
22 Apr 2021
Issue date:
5 Aug 2021