Adversarial Examples that Fool both Computer Vision and Time-Limited Humans

Gamaleldin F. Elsayed,Shreya Shankar,Brian Cheung,Nicolas Papernot,Alexey Kurakin,Ian J. Goodfellow,Jascha Sohl-Dickstein

Adversarial Examples that Fool both Computer Vision and Time-Limited Humans

2018

Gamaleldin F. Elsayed
Shreya Shankar
Brian Cheung
Nicolas Papernot
Alexey Kurakin
Ian J. Goodfellow
Jascha Sohl-Dickstein

Machine learning models are vulnerable to adversarial examples: small changes to images can cause computer vision models to make mistakes such as identifying a school bus as an ostrich. However, it is still an open question whether humans are prone to similar mistakes. Here, we address this question by leveraging recent techniques that transfer adversarial examples from computer vision models with known parameters and architecture to other models with unknown parameters and architecture, and by matching the initial processing of the human visual system. We find that adversarial examples that strongly transfer across computer vision models influence the classifications made by time-limited human observers.

Keywords:

Machine learning
Artificial intelligence
Computer science
Human visual system model
Adversarial system
Computer vision
Architecture
school bus

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

158

Citations