by Nicholas Carlini 2018-07-15 [last updated 2019-11-26]
From time to time I receive emails asking how to get started studying adversarial machine learning. Below is the list of papers I recommend reading to become familiar with the specific sub-field of evasion attacks on machine learning systems (i.e., adversarial examples).
Alternatively, you may be interested in seeing an (unfiltered) list of all 1000+ adversarial example papers.
There are three versions of this list:
Preliminary Papers
Evasion Attacks against Machine Learning at Test TimeAttacks (1)
Transferability in Machine Learning: from Phenomena to Black-Box Attacks using Adversarial SamplesDefenses
Towards Deep Learning Models Resistant to Adversarial AttacksAttacks (2)
ZOO: Zeroth Order Optimization based Black-box Attacks to Deep Neural Networks without Training Substitute ModelsPreliminary Papers
Evasion Attacks against Machine Learning at Test TimeAttacks [requires Preliminary Papers]
The Limitations of Deep Learning in Adversarial SettingsTransferability [requires Preliminary Papers]
Transferability in Machine Learning: from Phenomena to Black-Box Attacks using Adversarial SamplesDetecting Adversarial Examples [requires Attacks, Transferability]
On Detecting Adversarial PerturbationsRestricted Threat Model Attacks [requires Attacks]
ZOO: Zeroth Order Optimization based Black-box Attacks to Deep Neural Networks without Training Substitute ModelsPhysical-World Attacks [reqires Attacks, Transferability]
Adversarial examples in the physical worldVerification [requires Introduction]
Reluplex: An Efficient SMT Solver for Verifying Deep Neural NetworksDefenses (2) [requires Detecting]
Towards Deep Learning Models Resistant to Adversarial AttacksAttacks (2) [requires Defenses (2)]
Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial ExamplesDefenses (3) [requires Attacks (2)]
Towards the first adversarially robust neural network model on MNISTOther Domains [requires Attacks]
Adversarial Attacks on Neural Network Policies