International Business Machines Corporation
IDENTIFYING ADVERSARIAL ATTACKS WITH ADVANCED SUBSET SCANNING

Last updated:

Abstract:

An illustrative embodiment includes a method for detecting whether a given item within input data is anomalous. The method includes: extracting activations at respective nodes of an autoencoder trained on the data, the activations comprising activations at the respective nodes for the given item within the data and for one or more other items within the data; calculating p-values corresponding to the respective nodes, wherein the p-value for a given node represents a proportion of the activations at the given node for the other items which are greater than the activations at the given node for the given item; determining at least one score at least in part by scanning for a subset of the respective nodes which maximizes a statistical scoring function applied to the corresponding p-values; and providing a visualization of at least the given item within the input data to a human user.

Status:
Application
Type:

Utility

Filling date:

30 Jan 2020

Issue date:

29 Jul 2021