Source code for defense.nc

# MIT License

# Copyright (c) 2021 VinAI Research

# Permission is hereby granted, free of charge, to any person obtaining a copy
# of this software and associated documentation files (the "Software"), to deal
# in the Software without restriction, including without limitation the rights
# to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
# copies of the Software, and to permit persons to whom the Software is
# furnished to do so, subject to the following conditions:

# The above copyright notice and this permission notice shall be included in all
# copies or substantial portions of the Software.

# THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
# IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
# FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
# AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
# LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
# OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
# SOFTWARE.


'''
This file is modified based on the following source:
link : https://github.com/VinAIResearch/input-aware-backdoor-attack-release/tree/master/defenses
The defense method is called nc.

The update include:
	1. data preprocess and dataset setting
	2. model setting
	3. args and config
	4. save process
	5. new standard: robust accuracy
	6. implement finetune operation according to nc paper
basic sturcture for defense method:
	1. basic setting: args
	2. attack result(model, train data, test data)
	3. nc defense:
		a. initialize the model and trigger
		b. train triggers according to different target labels
		c. Determine whether the trained reverse trigger is a real backdoor trigger
			If it is a real backdoor trigger:
			d. select samples as clean samples and unlearning samples, finetune the origin model
	4. test the result and get ASR, ACC, RA 
'''

from defense.base import defense


[docs]class nc(defense):
    r"""Neural Cleanse: Identifying And Mitigating Backdoor Attacks In Neural Networks

        basic structure:

        1. config args, save_path, fix random seed
        2. load the backdoor attack data and backdoor test data
        3. load the backdoor model
        4. nc defense:
            a. initialize the model and trigger
            b. train triggers according to different target labels
            c. Determine whether the trained reverse trigger is a real backdoor trigger
                If it is a real backdoor trigger:
            d. select samples as clean samples and unlearning samples, finetune the origin model
        5. test the result and get ASR, ACC, RC

        .. code-block:: python

            parser = argparse.ArgumentParser(description=sys.argv[0])
            nc.add_arguments(parser)
            args = parser.parse_args()
            nc_method = nc(args)
            if "result_file" not in args.__dict__:
                args.result_file = 'one_epochs_debug_badnet_attack'
            elif args.result_file is None:
                args.result_file = 'one_epochs_debug_badnet_attack'
            result = nc_method.defense(args.result_file)

        .. Note::
            @inproceedings{wang2019neural,
            title={Neural cleanse: Identifying and mitigating backdoor attacks in neural networks},
            author={Wang, Bolun and Yao, Yuanshun and Shan, Shawn and Li, Huiying and Viswanath, Bimal and Zheng, Haitao and Zhao, Ben Y},
            booktitle={2019 IEEE Symposium on Security and Privacy (SP)},
            pages={707--723},
            year={2019},
            organization={IEEE}}

        Args:
            baisc args: in the base class
            ratio (float): the ratio of training data
            index (str): the index of clean data
            cleaning_ratio (float): the ratio of cleaning data used for finetuning the backdoor model
            unlearning_ratio (float): the ratio of unlearning data (the clean data + the learned trigger) used for finetuning the backdoor model
            nc_epoch (int): the epoch for neural cleanse to train the trigger
        """