NVIDIA Corporation
3D PLANE DETECTION AND RECONSTRUCTION USING A MONOCULAR IMAGE

Last updated: 23 Jul 2021

Abstract:

Planar regions in three-dimensional scenes offer important geometric cues in a variety of three-dimensional perception tasks such as scene understanding, scene reconstruction, and robot navigation. Image analysis to detect planar regions can be performed by a deep learning architecture that includes a number of neural networks configured to estimate parameters for the planar regions. The neural networks process an image to detect an arbitrary number of plane objects in the image. Each plane object is associated with a number of estimated parameters including bounding box parameters, plane normal parameters, and a segmentation mask. Global parameters for the image, including a depth map, can also be estimated by one of the neural networks. Then, a segmentation refinement network jointly optimizes (i.e., refines) the segmentation masks for each instance of the plane objects and combines the refined segmentation masks to generate an aggregate segmentation mask for the image.

Status:

Application

Type:

Utility

Filling date:

10 Sep 2019

Issue date:

28 May 2020

Full patent description

Patent application document

NVIDIA Corporation 3D PLANE DETECTION AND RECONSTRUCTION USING A MONOCULAR IMAGE

Abstract:

NVIDIA Corporation
3D PLANE DETECTION AND RECONSTRUCTION USING A MONOCULAR IMAGE