Intel Corporation
UTILIZING DATA PROVENANCE TO DEFEND AGAINST DATA POISONING ATTACKS
Last updated:
Abstract:
The present invention discloses a secure ML pipeline to improve the robustness of ML models against poisoning attacks and utilizing data provenance as a tool. Two components are added to the ML pipeline, a data quality pre-processor, which filters out untrusted training data based on provenance derived features and an audit post-processor, which localizes the malicious source based on training dataset analysis using data provenance.
Status:
Application
Type:
Utility
Filling date:
10 Aug 2021
Issue date:
2 Dec 2021