The Walt Disney Company
Joint estimation from images
Last updated:
Abstract:
Techniques are disclosed for estimating poses from images. In one embodiment, a machine learning model, referred to herein as the "detector," is trained to estimate animal poses from images in a bottom-up fashion. In particular, the detector may be trained using rendered images depicting animal body parts scattered over realistic backgrounds, as opposed to renderings of full animal bodies. In order to make appearances of the rendered body parts more realistic so that the detector can be trained to estimate poses from images of real animals, the body parts may be rendered using textures that are determined from a translation of rendered images of the animal into corresponding images with more realistic textures via adversarial learning. Three-dimensional poses may also be inferred from estimated joint locations using, e.g., inverse kinematics.
Utility
28 Feb 2019
9 Feb 2021