Fair Isaac Corporation
OVERLY OPTIMISTIC DATA PATTERNS AND LEARNED ADVERSARIAL LATENT FEATURES

Last updated:

Abstract:

Systems, methods and computer program products for improving security of artificial intelligence systems. The system comprising processors for monitoring one or more transactions received by a machine learning decision model to determine a first score associated with a first transaction. The first transaction may be identified as likely adversarial, in response to the first score being lower than a certain score threshold and the first transaction having a low occurrence likelihood. A second score may be generated in association with the first transaction based on one or more adversarial latent features associated with the first transaction. At least one adversarial latent feature may be detected as being exploited by the first transaction, in response to determining that the second score falls above the certain score threshold. Accordingly, an abnormal volume of activations of adversarial latent features spanning across a plurality of transactions scored may be detected and blocked.

Status:
Application
Type:

Utility

Filling date:

23 Nov 2020

Issue date:

26 May 2022