Mastercard Incorporated
METHODS AND SYSTEMS FOR REDUCING BIAS IN AN ARTIFICIAL INTELLIGENCE MODEL
Last updated:
Abstract:
Embodiments provide methods and systems for reducing bias in an artificial intelligence model. A method includes computing, by a processor, a reward value based at least in part on a similarity between model predictions from a pre-trained model and agent predictions from a Reinforcement Learning (RL) agent. The method includes performing each step of one or more steps of a rule of a plurality of rules. The rule is assigned a weight and the rule includes a protected attribute, a cumulative statistic value type, and a comparison threshold. The method includes sending a cumulative reward value generated using the reward value and each weighted punishment value computed based at least in part on applying each rule of the plurality of rules to the RL agent. The RL agent learns to biases from the agent predictions while maintaining similarity with model predictions by maximizing the cumulative reward value.
Utility
21 Apr 2021
28 Oct 2021