On the convergence and robustness of adversarial training
Version 2 2024-06-06, 10:41Version 2 2024-06-06, 10:41
Version 1 2019-01-01, 00:00Version 1 2019-01-01, 00:00
conference contribution
posted on 2024-06-06, 10:41 authored by Y Wang, X Ma, J Bailey, J Yi, B Zhou, Q GuCopyright © 2019 ASME Improving the robustness of deep neural networks (DNNs) to adversarial examples is an important yet challenging problem for secure deep learning. Across existing defense techniques, adversarial training with Projected Gradient Decent (PGD) is amongst the most effective. Adversarial training solves a min-max optimization problem, with the inner maximization generating adversarial examples by maximizing the classification loss, and the outer minimization finding model parameters by minimizing the loss on adversarial examples generated from the inner maximization. A criterion that measures how well the inner maximization is solved is therefore crucial for adversarial training. In this paper, we propose such a criterion, namely First-Order Stationary Condition for constrained optimization (FOSC), to quantitatively evaluate the convergence quality of adversarial examples found in the inner maximization. With FOSC, we find that to ensure better robustness, it is essential to use adversarial examples with better convergence quality at the later stages of training. Yet at the early stages, high convergence quality adversarial examples are not necessary and may even lead to poor robustness. Based on these observations, we propose a dynamic training strategy to gradually increase the convergence quality of the generated adversarial examples, which significantly improves the robustness of adversarial training. Our theoretical and empirical results show the effectiveness of the proposed method.
History
Related Materials
- 1.
Location
Long Beach, CaliforniaLanguage
engPublication classification
E1.1 Full written paper - refereedVolume
2019-JunePagination
11426-11438Start date
2019-06-09End date
2019-06-15ISBN-13
9781510886988Title of proceedings
ICML 2019 : Proceedings of the 36th International Conference on Machine LearningEvent
Machine Learning. Conference (2019 : 36th : Long Beach, California)Publisher
PMLRPlace of publication
[Long Beach, Calif.]Publication URL
Usage metrics
Categories
No categories selectedKeywords
Licence
Exports
RefWorksRefWorks
BibTeXBibTeX
Ref. managerRef. manager
EndnoteEndnote
DataCiteDataCite
NLMNLM
DCDC

