Using intuition from empirical properties to simplify adversarial training defense

Guanxiong Liu, Issa Khalil, Abdallah Khreishah

Research output: Chapter in Book/Report/Conference proceedingConference contribution

2 Scopus citations

Abstract

Due to the surprisingly good representation power of complex distributions, neural network (NN) classifiers are widely used in many tasks which include natural language processing, computer vision and cyber security. In recent works, people noticed the existence of adversarial examples. These adversarial examples break the NN classifiers' underlying assumption that the environment is attack free and can easily mislead fully trained NN classifier without noticeable changes. Among defensive methods, adversarial training is a popular choice. However, original adversarial training with single-step adversarial examples (Single-Adv) can not defend against iterative adversarial examples. Although adversarial training with iterative adversarial examples (Iter-Adv) can defend against iterative adversarial examples, it consumes too much computational power and hence is not scalable. In this paper, we analyze Iter-Adv techniques and identify two of their empirical properties. Based on these properties, we propose modifications which enhance Single-Adv to perform competitively as Iter-Adv. Through preliminary evaluation, we show that the proposed method enhances the test accuracy of state-of-the-art (SOTA) Single-Adv defensive method against iterative adversarial examples by up to 16.93% while reducing its training cost by 28.75%.

Original languageEnglish (US)
Title of host publicationProceedings - 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshop, DSN-W 2019
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages58-61
Number of pages4
ISBN (Electronic)9781728130309
DOIs
StatePublished - Jun 2019
Event49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshop, DSN-W 2019 - Portland, United States
Duration: Jun 24 2019Jun 27 2019

Publication series

NameProceedings - 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshop, DSN-W 2019

Conference

Conference49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshop, DSN-W 2019
Country/TerritoryUnited States
CityPortland
Period6/24/196/27/19

All Science Journal Classification (ASJC) codes

  • Artificial Intelligence
  • Information Systems
  • Safety, Risk, Reliability and Quality

Keywords

  • adversarial example
  • adversarial training
  • neural network classifier

Fingerprint

Dive into the research topics of 'Using intuition from empirical properties to simplify adversarial training defense'. Together they form a unique fingerprint.

Cite this