one
SYSTEMS PROVIDING A LEARNING CONTROLLER UTILIZING INDEXED MEMORY AND METHODS THERETO

Last updated:

Abstract:

A system includes one or more memory devices storing instructions, and one or more processors configured to execute the instructions to perform steps of a method. A method can include receiving observations and a corresponding class label, determining a candidate key based on the observations, determining a current memory state of a memory module based on a similarity of stored keys to the candidate key, generating a measurement vector based on the current memory state, concatenating the candidate key and measurement vector to form a state vector, determining, based on the state vector and an action distribution policy, an action of a plurality of actions such that the determined action maximizes an expected reduction in entropy as compared to the remaining actions of the plurality actions, executing the determined action, determining a value of the determined action, and updated, based on the value, the action distribution policy.

Status:
Application
Type:

Utility

Filling date:

23 Nov 2020

Issue date:

20 May 2021