Royal Bank of Canada
DEVICES AND METHODS FOR REINFORCEMENT LEARNING VISUALIZATION USING IMMERSIVE ENVIRONMENTS

Last updated:

Abstract:

Disclosed are systems, methods, and devices for generating a visualization of a deep reinforcement learning (DRL) process. State data is received, reflective of states of an environment explored by an DRL agent, each state corresponding to a time step. For each given state, saliency metrics are calculated by processing the state data, each metric measuring saliency of a feature at the time step corresponding to the given state. A graphical visualization is generated, having at least two dimensions in which: each feature of the environment is graphically represented along a first axis; and each time step is represented along a second axis; and a plurality of graphical markers representing corresponding saliency metrics, each graphical marker having a size commensurate with the magnitude of the particular saliency metric represented, and a location along the first and second axes corresponding to the feature and time step for the particular saliency metric.

Status:
Application
Type:

Utility

Filling date:

31 Jul 2020

Issue date:

4 Feb 2021