SpecAttack: Specification-Based Adversarial Training for Deep Neural Networks.

Fabian Bauer-Marquart,David Boetius,Stefan Leue,Christian Schilling

SpecAttack: Specification-Based Adversarial Training for Deep Neural Networks.

2021

Safety specification-based adversarial training aims to generate examples violating a formal safety specification and therefore provides approaches for repair. The need for maintaining high prediction accuracy while ensuring the save behavior remains challenging. Thus we present SpecAttack, a query-efficient counter-example generation and repair method for deep neural networks. Using SpecAttack allows specifying safety constraints on the model to find inputs that violate these constraints. These violations are then used to repair the neural network via re-training such that it becomes provably safe. We evaluate SpecAttack's performance on the task of counter-example generation and repair. Our experimental evaluation demonstrates that SpecAttack is in most cases more query-efficient than comparable attacks, yields counter-examples of higher quality, with its repair technique being more efficient, maintaining higher functional correctness, and provably guaranteeing safety specification compliance.

Keywords:

Correction
Source
Cite
Save
Machine Reading By IdeaReader

References

Citations