NVIDIA Corporation
GUIDED UNCERTAINTY-AWARE POLICY OPTIMIZATION: COMBINING MODEL-FREE AND MODEL-BASED STRATEGIES FOR SAMPLE-EFFICIENT LEARNING

Last updated: 23 Jul 2021

Abstract:

A robot is controlled using a combination of model-based and model-free control methods. In some examples, the model-based method uses a physical model of the environment around the robot to guide the robot. The physical model is oriented using a perception system such as a camera. Characteristics of the perception system may be are used to determine an uncertainty for the model. Based at least in part on this uncertainty, the system transitions from the model-based method to a model-free method where, in some embodiments, information provided directly from the perception system is used to direct the robot without reliance on the physical model.

Status:

Application

Type:

Utility

Filling date:

3 Feb 2020

Issue date:

20 May 2021

Full patent description

Patent application document

NVIDIA Corporation GUIDED UNCERTAINTY-AWARE POLICY OPTIMIZATION: COMBINING MODEL-FREE AND MODEL-BASED STRATEGIES FOR SAMPLE-EFFICIENT LEARNING

Abstract:

NVIDIA Corporation
GUIDED UNCERTAINTY-AWARE POLICY OPTIMIZATION: COMBINING MODEL-FREE AND MODEL-BASED STRATEGIES FOR SAMPLE-EFFICIENT LEARNING