NVIDIA Corporation
ITERATIVE SPATIO-TEMPORAL ACTION DETECTION IN VIDEO

Last updated:

Abstract:

Iterative prediction systems and methods for the task of action detection process an inputted sequence of video frames to generate an output of both action tubes and respective action labels, wherein the action tubes comprise a sequence of bounding boxes on each video frame. An iterative predictor processes large offsets between the bounding boxes and the ground-truth.

Status:
Application
Type:

Utility

Filling date:

22 Apr 2021

Issue date:

5 Aug 2021