Sensei

Fuzz Testing based Data Augmentation to Improve Robustness of Deep Neural Networks

Overview

Deep neural networks (DNN) have been shown to be notoriously brittle to small perturbations in their input data. This problem is analogous to the over-fitting problem in test-based program synthesis and automatic program repair, which is a consequence of the incomplete specification, i.e. the limited tests or training examples, that the program synthesis or repair algorithm has to learn from. Recently, test generation techniques have been successfully employed to augment existing specifications of intended program behavior, to improve the generalizability of program synthesis and repair. Inspired by these approaches, in this paper, we propose a technique that re-purposes software testing methods, specifically mutation-based fuzzing, to augment the training data of DNNs, with the objective of enhancing their robustness. Our technique casts the DNN data augmentation problem as an optimization problem. It uses genetic search to generate the most suitable variant of an input data to use for training the DNN, while simultaneously identifying opportunities to accelerate training by skipping augmentation in many instances. We instantiate this technique in two tools, Sensei and Sensei-SA, and evaluate them on 15 DNN models spanning 5 popular image data-sets. Our evaluation shows that Sensei can improve the robust accuracy of the DNN, compared to the state of the art, on each of the 15 models, by upto 11.9% and 5.5% on average. Further, Sensei-SA can reduce the average DNN training time by 25%, while still improving robust accuracy.

DataSet	Model	Standard	Random	W-10	Sensei
GTSRB	Model-1	3.20	77.60	85.80	90.80
	Model-2	2.40	70.80	84.60	86.30
	Model-3	0.70	72.10	83.30	88.70
	Model-4	1.60	72.80	82.40	86.90
FashionMnist	Model-1	0.20	65.70	79.20	83.60
	Model-2	0.30	60.20	72.20	78.00
	Model-3	0.20	63.10	73.90	77.40
CIFAR-10	Model-1	1.30	52.20	61.80	67.20
	Model-2	1.40	56.70	64.90	67.90
	Model-3	1.80	73.30	76.50	81.50
	Model-4	1.10	47.50	60.10	66.20
IMDB	Model-1	28.00	69.00	71.90	79.60
IMDB	Model-2	29.30	74.10	81.90	83.90
SVHN	Model-1	0.40	75.20	83.60	85.50
	Model-2	0.70	56.00	67.80	74.90

Sensei

Overview

Publication

Fuzz Testing based Data Augmentation to Improve Robustness of Deep Neural Networks

Experiments

People

Xiang Gao

Ripon K. Saha

Mukul R. Prasad

Abhik Roychoudhury