defense.nad

Bases: defense

Neural Attention Distillation: Erasing Backdoor Triggers From Deep Neural Networks

basic structure:

config args, save_path, fix random seed
load the backdoor attack data and backdoor test data
load the backdoor model
nad defense:
1. create student models, set training parameters and determine loss functions
2. train the student model use the teacher model with the activation of model and result
test the result and get ASR, ACC, RC

parser = argparse.ArgumentParser(description=sys.argv[0])
nad.add_arguments(parser)
args = parser.parse_args()
nad_method = nad(args)
if "result_file" not in args.__dict__:
    args.result_file = 'one_epochs_debug_badnet_attack'
elif args.result_file is None:
    args.result_file = 'one_epochs_debug_badnet_attack'
result = nad_method.defense(args.result_file)

Note

@inproceedings{li2020neural, title={Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks}, author={Li, Yige and Lyu, Xixiang and Koren, Nodens and Lyu, Lingjuan and Li, Bo and Ma, Xingjun}, booktitle={International Conference on Learning Representations}, year={2020}}

Parameters:

args (baisc) – in the base class
ratio (float) – the ratio of training data
index (str) – the index of clean data
te_epochs (int) – the number of epochs for training the teacher model using the clean data
beta1 (int) – the beta of the first layer
beta2 (int) – the beta of the second layer
beta3 (int) – the beta of the third layer
p (float) – the power of the activation of model for AT loss function
teacher_model_loc (str) – the location of teacher model(if None, train the teacher model)