International Business Machines Corporation
OPTIMIZING A MACHINE FOR SOLVING OFFLINE OPTIMIZATION PROBLEMS
Last updated:
Abstract:
A method for improving a machine operation are provided. The method includes receiving a plurality of domain specific heuristics and a set of states and a set of actions, where an immediate cost and/or reward is associated with a pair of state and action. The method also includes generating at least one of: a graph of state transitions for the actions, and a transition probability matrix. The method also includes executing a Markov Decision Process (MDP) model for solving an MDP problem, and outputting an MDP optimal policy of an optimal mapping of a given state to an action. The method also includes selecting one of the plurality of domain specific heuristics and heuristic input parameters thereof. The method also includes controlling the machine for solving a predefined optimization problem in a plurality of execution iterations.
Utility
25 Sep 2020
31 Mar 2022