Meta Platforms, Inc.
Identifying regions of interest in captured video data objects by detecting movement within higher resolution frames of the regions
Last updated:
Abstract:
Multiple users communicate over a network via client devices that include one or more cameras and a display to enable video messaging. At least one of the client devices modifies regions of video data captured by the client device's camera to more prominently identify the people within the video data. To identify a person, the client device disambiguates between actual people and static objects that may appear like people. The client device uses pose models to identify bounding boxes and applies a motion model to determine if a bounding box may include a person based on an amount of movement within the bounding box. If a threshold amount of movement is detected in a bounding box, the client device obtains a higher resolution portion of the scene including the bounding box and classifies whether the bounding box contains a person based on movement within the higher resolution video.
Utility
9 Sep 2019
22 Dec 2020